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(57) Abstract: The present invention 
relates to crystals of the Escherichia 
coli MurG, a membrane-associated 
UDP-slycosyl transferase involved 
in peptidoglycan biosynthesis. The 
present invention also relates to 
three-dimensional atomic coordinates 
of the MurG protein, three-dimen- 
sional structures of the protein, and 
images thereof. The present invention 
also relates to the atomic coordinates 
and three-dimensional structures 
of the a-carbon backbone and the 
a-carbon backbone and conserved 
amino acid residue sidechains 
of the MurG protein and images 
-4 thereof. The present invention further 

relates to three-dimensional atomic 

coordinates of the donor nucleotide binding site, the acceptor binding site^ and the membrane association site of the MurG protein, 
three-dimensional sturctures of the binding domains, and images thereof. The present invention also relates to computer readable 
media encoded with sets of the three dimensional coordinates described herein. The present invention relates to methods of 
crystallizing MurG proteins. The present invention relates to models of three dimensional structures of UDP-glycosyltransferases 
and, in particular, MurG proteins, based on the three dimensional structure dimensional structure of crystals of the Escherichia coli 
MurG. The present invention also relates to models of the three dimensional structures of the a-carbon backbone and the a-carbon 
backbone and conserved amino acid residue sidechains of UDP-glycosyltransferases and MurG proteins and of the binding sites 
thereof. The present invention also relates to methods of drug design using models of this invention, the compounds identified 
using models of the present invention that bind, inhibit or stimulate UDP-glycosyltransferases or MurG proteins, and compositions 
comprising compounds identified using the models of this invention for therapeutic or diagnositic uses. Also, the present invention 
relates to methods of making models of the present invention. 
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METHODS OF MAKING MODELS, METHODS OF USING MODELS OF 
MURG* COMPOUN DS THAT BIND, INHIBIT OR STIMULATE MURG 
PROTEINS, AND THERAPEUTIC COMPOSITIONS THEREOF 



FIELD OF THE INVENTION 

The present invention relates to crystals of the Escherichia coli MurG, a membrane- 
associated UDP-glycosyltransferase involved in peptidoglycan biosynthesis. The present 
invention also relates to three-dimensional atomic coordinates of the MurG protein, 
three-dimensional structures of the protein, and images thereof. The present invention 
also relates to the atomic coordinates and three-dimensional structures of the a-carbon 
backbone of the MurG protein and images thereof The present invention further relates 
to the atomic coordinates and three-dimensional structures of the a-carbon backbone and 
conserved amino acid residue sidechains of the MurG protein and images thereof The 
present invention further relates to three-dimensional atomic coordinates of the donor 
nucleotide binding site, the acceptor binding site, and the membrane association site of 
the MurG protein, three-dimensional structures of the binding domains; and images 
thereof. The present invention also relates to computer readable media encoded with sets 
of the three dimensional coordinates of the E. coli MurG protein, the a-carbon. backbone 
of the MurG protein, the a-carbon backbone and the conserved amino acid residue 
sidechains of the MurG protein, the donor nucleotide binding site, the acceptor binding 
site, and the membrane association site. The present invention relates to methods of 
crystallizing MurG proteins. 
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The present invention relates to models of three dimei^tonal structures of UDP- 
glycosyltransferases and, in particular, MurG proteins, based on the three dimensional 
structure of crystals of the Escherichia coli MurG. The present invention also relates to 
models of the three dimensional structures of the a-carb'on backbone of UDP- 
glycosyltransferases and MurG proteins. The present invention further relates to models 
of the three dimensional structure of the ct-carbon backbone and conserved amino acid 
residue sidechains of gUDP-glycosyltransferases, in particular, MurG proteins. The 
present invention further relates to models of the three-dimensional structures of donor 
nucleotide binding sites, acceptor binding sites, and membrane association sites of UDP- 
glycosyltransferases, in particular, MurG proteins. The present invention also relates to 
methods of drug design using models of this invention. The present invention further 
relates to compounds identified using models of the present invention that bind, inhibit or 
stimulate UDP-glycosyltransferases or MurG proteins. The present invention relates to 
compositions comprising compounds identified using the models of this invention for 
therapeutic or diagnositic uses. Also, the present invention relates to methods of making 
models of the present invention. 

BACKGROUND OF THE INVENTION 

The increasing frequency of resistance to existing antibiotics represents a serious 
public health threat. Structural and mechanistic information on essential bacterial 
enzymes could lead to the development of antibiotics that are active against resistant 
microorganisms. Both gram positive and gram negative bacterial cells are surrounded by 
a cross-linked carbohydrate polymer, peptidoglycan, which protects them from rupturing 
under high osmotic pressures. Many of the best antibiotics function by inhibiting 
peptidoglycan synthesis, which ultimately causes cell lysis. In recent years, intense 
effort has been focused on determining the structures of the enzymes that synthesize 
peptidoglycan. Structures of several of the early enzymes in the biosynthetic pathway 
have been reported (Benson et al., 1995; Bertrand et al.. 1997; Fan et al., 1994; 
Skarzynski et al., 1996); however, the later enzymes have proven more difficult to study 
because both they and their substrates are membrane-associated. 

MurG is the last enzyme involved in the intracellular phase of peptidoglycan 
synthesis (Bugg & Walsh, 1993). It catalyzes the transfer of N-acetyl glucosamine 
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(NAG) from IB to the C4 hydroxyl of a lipid-linked »tylmuramoyl pentapeptide" 
(NAM) to form a Minked NAG-NAM disaccharide that is transported across the cell 
membrane where it is polymerized and cross-linked (Fig. I). In bacterial cells MurG 
associates with the cytoplasmic surface of the membrane (Bupp & van Heijenoort, 1993). 
However, we have found that E. coli MurG can be solubilized at high concentrations in 

active form (Ha et al., 1999). 

The elucidation of the protein structure of a MurG protein is of importance in the 
identification and formulation of anti-bacterial agents. Until the discovery of the present 
invention, the structure and resulting mechanism by which MurG functions was not 
known. Thus, despite the important role of MurG in peptidoglycan synthesis, 
development of useful agents for treatment or diagnosis of disease was hindered by lack 
of structural information of the protein. 

In order to obtain structural information on a MurG protein, it is important to 
have purified, active enzyme. The demonstration of activity requires a suitable assay, 
which in turn requires access to the natural substrates or analogues thereof. The study of 
MurG was hampered by difficulties obtaining and handling the lipid-linked NAM 
substrate (commonly known as Lipid I). This problem was overcome by Walker and 
coworkers, who developed a synthetic route to a set of substrate analogues of Lipid I that 
were shown to function as glycosyl acceptor, in a glycosyl transfer reaction catalyzed by 
MurG Some of these substrate analogues are freely water soluble, making it possible' to 
monitor the activity of purified E. coli MurG in buffer in the absence of natural or 
* artificial membranes or detergents. 

The linear nucleic acid and amino acid sequences of E. coli MurG were reported 
1992. Subsequently, the nucleic acid and amino acid sequence of B. subtilus MurG 
.ported. Since then, many bacterial genomes have been sequenced and the 
information has been deposited in databases. Information based only on linear 
sequences, however, cannot accurately predict the three-dimensional structure of the 
protein and its functional domains. 

Therefore, there is a need in the art to elucidate the three-dimensional structure of 
a MurG protein. One three dimensional structure of a MurG-protetn can be used to 
construct models of other MurG proteins and to facilitate the structure determinate of 
: crystalline forms of other MurG proteins. Structures and mode.s of MurG proteins can 



in 

was ret 



WO 01/90301 PCT/USO 1/1 1500 

also be used to de^n proteins containing only the donor bn^Kg site or the acceptor " 
binding site. These proteins can be used in assays, including NMR-based assays, to 
identify - or characterize the mode of binding of - ligands that bind in or near the 
vicinity of the substrates. These ligands or compounds can then be used as leads for the 
design of inhibitors that have therapeutic activity. Structures and models of MurG 
proteins can also be used in computer-based drug design. 

SUMMARY OF THE INVENTION 
The present invention relates to crystalline Escherichia coli MurG protein. 
Obtaining such crystals is an unexpected result. It is well known in the protein 
crystallographic art that obtaining crystals of quality sufficient for determining the 
structure of a protein 'is unpredictable. In particular, obtaining crystals of quality 
sufficient for determining the three-dimensional (3-D) structure of MurG has not been 
achievable until the crystallization of MurG as disclosed in the present application. As 
such, determination of the three-dimensional structure of MurG has not been possible 
until the discovery of the present invention. Additionally, until the discovery of the 
present invention, derivation of the three-dimensional structure and models of other 
MurG proteins has not been possible. The present inventors are also the first to define 
the three-dimensional structure and provide three-dimensional models for drug design for 
MurG proteins. 

Accordingly, one object of the present invention is to provide crystals of 
sufficient quality to obtain a determination of the three-dimensional atomic coordinates 
and structures of MurG to high resolution, preferably to the resolution of less than 2.0 
angstroms (A). The present invention also provides methods for producing crystalline 
MurG protein. 

The value of the crystals of E. coli MurG protein extends beyond merely being 
able to obtain such crystals. The knowledge obtained concerning the MurG crystal 
structure, for example, has been used by the present inventors to define the heretofore 
unknown tertiary structure of the MurG protein and to identify the location of the 
glycosyl donor and glycosyl acceptor binding domains, as well as the location of the 
amino acid residues that are invariant in all MurG proteins. This information can be used 
to design inhibitors of MurG that have therapeutic utility. The atomic coordinates of E. 
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co/i MurG also are used to model the heretofore unknown tertiary structures of other 
MurG proteins having substantially related linear amino acid sequences, such as for 
MurG proteins from other microorganisms. It is anticipated that homology models can 
be constructed even from amino acid sequences with relatively low homology because 
the present inventors have identified the location of the invariant amino acid residues in 
MurG. The relative spatial orientations of such residues is expected to be conserved in 
all MurG proteins. 

Comparison of nucleic acid and amino acid sequences of MurG proteins indicates 
that the linear amino acid sequences can vary significantly. Homology between MurG 
proteins from different microorganisms varies from less than 30% to greater than 90%, 
reflecting the evolutionary relationship between the organisms. The low homology . 
between distantly related MurG homologues is not believed to reflect significantly 
different folded structures. It is well known that many amino acid sequences are capable 
of adopting the same general fold. E. coli MurG contains an alpha/beta folding pattern, 
one of the most common folds known in proteins. It is likely that all MurG homologues 
contain a similar alpha/beta fold despite the differences in the linear amino acid 
sequences. What gives these proteins their identity is not the general fold, but the 
specific details - i.e., the presentation of certain amino acids on the folded structure. The 
present inventors have identified the location in E. coli MurG of a set of residues that are 
invariant in all MurG homologues. It is to be expected that these residues would adopt a 
similar, spatial location with respect to the folded structure in all MurG homologues. 
Therefore, these invariant residues, which have been selected by evolution as the critical 
residues for the binding and catalytic function of the protein, provide essential 
information on the location of the active site and on critical contacts to the 
substrates/products. They also serve as constraints that make it possible to predict the 
three-dimensional structures even of distantly related MurG homologues. Thus, 
knowledge of the three-dimensional structure of the E. coli MurG protein has provided a 
starting point for. investigation into the structure of all MurG proteins. 

Accordingly, a object of the present invention is to provide information regarding 
the atomic coordinates and three-dimensional structures of (1) the MurG protein, (2) the 
a-carbon backbone of the MurG protein, (3) the a-carbon backbone and conserved 
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amino acid residues of the MurG protein, (4) the donor nucleoTOe binding site, (5) the 
acceptor binding site, and (6) the membrane association site MurG proteins. 

It is also an object of this invention to solve the three-dimensional structure of 
UDP-glycosyltransferases, in particular target MURG proteins, and to determine their 
structure and/or atomic coordinates. Further, it is an object of this invention to use the 
structure or atomic coordinates of the E. coli MurG crystal to solve the structure of 
different MURG protein crystals, or a crystal of a mutant protein, homolog or co- 
complex of MurG. 

The present invention relates to models of three dimensional structures of UDP- 
glycosyltransferases, in particular MurG proteins, based on the atomic coordinates of 
crystalline E. coli MurG protein. 

It is a further object of this invention to provide UDP-glycosyltransferase 
enzyme mutants characterized by one or more different properties as compared with 
wild-type MURG. These properties include altered surface charge, increased stability to 
subunit dissociation, altered substrate specificity or higher specific activity. MURG 
mutants are useful to identify those amino acids that are most important for the 
enzymatic activity of MURG. This information, in turn, allows the design of improved 
inhibitors of MURG as compared with peptidic MURG inhibitors. 

Another object of the present invention is to provide computer readable mediums 
encoded with a set of three-dimensional coordinates of the E. coli MurG protein, the ct- 
carbon backbone of the MurG protein, the a-carbon backbone and conserved amino acid 
residues of the MurG protein, and the nucleotide donor binding site, the acceptor binding 
site, the membrane association site of the MurG protein. 

Another embodiment of the present invention provides three-dimensional and 
two-dimensional computer images of the three dimensional structure of MurG protein, 
the a-carbon backbone of the MurG protein, the a-carbon backbone and conserved 
amino acid residues of the MurG protein, and the nucleotide donor binding site, the 
acceptor binding site, the membrane association site of the MurG protein. 

The knowledge of the three dimensional structure of MurG also provides a means 
for designing proteins that have altered beneficial functions by analyzing the structure 
and interactions between individual amino acids of the protein. For example, the present 
inventors have shown that E. coli MurG consists of two domains separated by a cleft. 
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Noncovalent interactions between the two domains are not extensive. The present 
inventors have shown that the domains fold independently and can, therefore, be 
expressed independently either alone or as part of a recombinant protein containing the 
acceptor binding site from one MurG homologue and the donor binding site from another 
MurG homologue. It would be expected that the domains of other MurG proteins could 
also be expressed independently, either alone or as chimaeras with other MurG domains. 
Independently expressed domains of the protein are useful for discovering ligands that 

bind to the individual domains. 

The knowledge of the three-dimensional structure of E. coli MurG protein and 
models of other MurG proteins also provides a means for designing and producing 
compounds that regulate, inhibit or antagonize functions of the MurG protein (i.e., 
structure based drug design). For example, chemical compounds can be designed to 
block binding of UDP-GlcNAc to a MurG protein using various computer programs and 
models. 

It is also an object of this invention to use the structure coordinates and atomic 
details of MURG, or its mutants or homologues or co-complexes, to design, evaluate 
computationally, synthesize and use inhibitors of MURG that avoid the undesirable 
physical and pharmacologic properties of peptidic MURG inhibitors. . 
Another embodiment of the present invention is a composition comprising MurG protein 
in a crystalline form. 

Yet another embodiment of the present invention is a method for producing 
crystals of MurG, comprising combining MurG protein in a suitable buffer with a 
suitable amount of a reservoir buffer containing a detergent, and inducing crystal 
formation to produce said MurG crystals. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Fig. 1 . Pathway for peptidoglycan biosynthesis. 

Fig. 2. Overall architecture of MurG. A. Stereo view of the MurG structure. The N 
domain is shown in purple; the C domain is shown in green. The f.gure was generated 
with the programs MOLSCRIPT (Klaulis, 1991) and RASTER3D (Merrit & Murphy, 
1994). B. Topology diagram of MurG. 
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Fig. 3. Identification of critical residues in MurG and related glycosyltransferases. A. 
Sequence alignment of E. coli MurG with homologs from seven other bacterial strains, 
deliberately chosen to represent a disparate group of organisms. The secondary structure 
of £. coli MurG is shown above the sequences. Gaps mapping to the loop regions of E. 
coli MurG suggest that some sequences include other structural elements. Residues 
highlighted in blue are invariant among the eighteen MurG sequences available. 
Residues highlighted in yellow are identical in 85% of the eighteen homologs, while in 
the remaining 15%, only closely related amino acid substitutions are found. Highly 
conserved residues that do not meet the stringent criteria established for highlighting are 
shown in the consensus sequence. A consensus motif for UDP-giucuronosyltransferases 
is also shown. Numbering is with respect to the overexpressed E. coli MurG construct, 
which contains an additional N-terminal methionine. B. Mapping of the G loops and 
other highlighted residues from Fig. 3a in red on the MurG structure. Side chains for 
highly conserved residues are also shown. C. Model for the proposed UDP-binding 
subdomain found in many UDP-glycosyltransferases based on the E. coli MurG 
structure. Conserved residues in UDP-glucuronosy (transferases are highlighted in red. 
Side chains are shown for residues that are located near the cleft and may be involved in 
substrate binding. The glutamate residue is proposed to interact with the ribose sugar. 
The dotted loop varies in length within the MurG family and in other UDP-sugar 
transferases, but the N and Q on the following helix are invariant. Note that the UDP- 
glucuronosyltransferases contain a conserved D preceding the Q, which is not shown on 
this model. 

Fig 4. Structural analysis of the substrate binding pockets in MurG. A.- Structural 
comparison between the C-terminal domain of phage T4 ?-glucosyltransferase (left) and 
the C-terminal domain of E. coli MurG (right). The aligned six ?-strands are magenta, 
the aligned ?-helices are orange, and the other structural elements are blue. In ?- 
glucosyltransferase, key residues involved in UDP binding are highlighted in yellow. 
The analogous residues in MurG are also highlighted in yellow. B. A close-up view of 
the proposed donor binding pocket in the MurG C domain with the docked UDP- 
GlcNAc. Conserved residues in MurG are colored magenta. The carbonyl oxygen of 
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residue I245™Wiown in red, and its backbone nitrogen is^P^n in blue. C The surface - 
of E. coli MurG. The G loops and other conserved residues in MurG are colored 
magenta. The proposed membrane binding interface is also highlighted with 
hydrophobic residues in yellow and positively charged residues in blue. 

DEFINITIONS 

It is to be noted that the term "a" or "an" entity refers to one or more of that 
entity; for example, a compound refers to one or more compounds or at least one 
compound. As such, the terms "a" (or "an"), "one or more", and "at least one" can be 
used interchangeably herein. 

It is also to be noted that the terms "comprising", "including" and; "having" can 
be used interchangeably. Furthermore, a compound "selected from the group consisting 
of refers to one or more of the compounds in the list that follows, including mixtures 
(i.e., combinations) of two or more of the compounds. 

According to the present invention, an isolated, or pure, protein, is a protein that 
has been removed form its natural milieu. As such, "isolated" and "biologically pure" do 
not necessarily reflect. the extent to which the protein has been purified, An isolated 
protein of the-present invention can be obtained from its natural source, can be produced 
■ using recombinant DNA technology or can be produced by chemical synthesis. 

It is also to be noted that the terms "tertiary" and "three dimensional" can be used 
interchangeably. 

It is also to be noted that reference to a "MurG protein" can also be recited as 
"MurG" and such terms can be used to refer to the complete MurG protein, a portion of 
the MurG protein, such as a polypeptide. 

The following terms are also used herein: 

The term "naturally occurring amino acids" means the L-isomers of the naturally 
occurring amino acids. The naturally occurring amino acids are glycine, alanine, valine, 
leucine, isoleucine, serine, methionine, threonine, phenylalanine, tyrosine, tryptophan, 
cysteine, proline, histidine, aspartic acid, asparagine, glutamic acid, glutamine, .'gamma, 
carboxyglutamic acid,, arginine, ornithine and lysine. Unless specifically indicated, all 
amino acids referred to in this application are in the L-form. 
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The term "unnatural amino acids" means amino acids UWare not naturally found 
in proteins. Examples of unnatural amino acids used herein, include racemic mixtures of 
selenocysteine and selenomethionine. In addition, unnatural amino acids include the D or 
L forms of nor-leucine, para-nitrophenylalanine, homophenylalanine, para- 
fluorophenylalanine, 3-amino-p2-benzylpropionic acid, homoarginine, and D- 
phenylalanine. 

The term "positively charged amino acid" includes any naturally occurring or 
unnatural amino acid having a positively charged side chain under normal physiological 
conditions. Examples of positively charged naturally occurring amino acids are arginine, 
lysine and histidine. 

The term "negatively charged amino acid" includes any naturally occurring or 
unnatural amino acid having a negatively charged side chain under normal physiological 
conditions. Examples of negatively charged naturally occurring amino acids are aspartic 
acid and glutamic acid. 

The term "hydrophobic amino acid" means any amino acid having an uncharged, 
nonpolar side chain that is relatively insoluble in water. Examples of naturally occurring 
hydrophobic amino acids are alanine, leucine, isoleucine, valine, proline, phenylalanine, 

tryptophan and methionine. 

The term "hydrophilic amino acid" means any amino acid having an uncharged, 
polar side chain that is relatively soluble in water. Examples of naturally occurring 
hydrophilic amino acids are serine, threonine, tyrosine, asparagine, glutamine, and 
cysteine. 

The term "MurG" refers to a UDP-glycosyltransferase that has a two domain 
strucuture, where each domain contains a set of invariant residues as shown in Figure 3a, 
including any mutant, homologue or co-complex or any similar enzyme that catalyzes the 
transfer of N-acetylglucosamine (GlcNAc) from UDP to the C4 hydroxyl of the lipid- 
linked MurNAc pentapeptide. 

The term "mutant" refers to a MurG polypeptide, i.e.. a polypeptide displaying 
the biological activity of a wild-type MurG, characterized by the replacement of at least 
one amino acid from the wild-type, E. coli MURG sequence according to lkeda, et at, 
Nucleic Acids Res. 1990, and Mengin-LeCreuix et at.. Nucleic Adds Res. 1990. Such a 
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mutant may be prepared, for example, by expression of MURG cDNA previously altered 
in its coding sequence by PCR-based mutagenesis method. 

MurG mutants may also be generated by site-specific incorporation of unnatural 
amino acids into MURG proteins using the general biosynthetic method of Noren, C. J., 
et al., Science, 244, pp. 182-188 (1989). In this method, the codon encoding the amino 
acid of interest in wild-=type MURG is replaced by a "blank" nonsense codon, TAG, 
using oligonucleotide-directed mutagenesis (described in detail, infra). A suppressor 
tRNA directed against this codon is then chemically aminoacylated in vitro with the 
desired unnatural amino acid. The aminoacylated tRNA is then added to an in vitro 
translation system to yield a mutant MURG enzyme with the site-specific incorporated 

unnatural amino acid. 

Selenocysteine or selenomethionine may be incorporated into wild-type or mutant 
MURG by expression of MURG-encoding cDNAs in auxotrophic E. coli strains. 
Hendrickson, W. A. et al., EMBO J., 9(5), pp. 1665-1672 (1990). In this method, the 
wild-type or mutagenized MURG CDNA may be expressed in a host organism on a 
growth medium depleted of either natural cysteine or methionine (or both) but enriched 
in selenocysteine or selenomethionine (or both). 

The term "altered surface charge" means a change in one or more of the charge 
units of a mutant polypeptide, at physiological pH, as compared to wild-type MURG. 
This is preferably achieved by mutation of at least one amino acid of wild-type MURG -to 
an amino acid comprising a side chain with a different charge at physiological pH than 
the original wild-type side chain. 

The change in surface charge is determined by measuring the isoelectric point 
(pi) of the polypeptide molecule containing the substituted amino acid and comparing it 
to the isoelectric point of the wild-type MURG molecule. _ 

The term "altered substrate specificity" refers to a change in the ability of a 
mutant MURG to cleave a substrate as compared to wild-type MURG.. 

The "kinetic form" of MURG refers to the condition of the enzyme in its free or 
unbound fonn or bound to a chemical entity at either its active site or accessory binding 
site. 

A "competitive" inhibitor is one that inhibits MURG activity by binding to the 
same kinetic form, of MURG, as its substrate binds-thus directly competing with the 
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substrate for the aSve site of MURG. Competitive inhibition 



be reversed completely : * 
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by increasing the substrate concentration. 

An "uncompetitive" inhibitor is one that inhibits MURG by binding to a different 
kinetic form of the enzyme than does the substrate. Such inhibitors bind to MURG 
already bound with the substrate and not to the free enzyme. Uncompetitive inhibition 
cannot be reversed completely by increasing the substrate concentration. 

A "non-competitive" inhibitor is one that can bind to either the free or substrate 
bound form of MURG. 

Those of skill in the art may identify inhibitors as competitive, uncompetitive or 
non-competitive, by computer fitting enzyme kinetic data using standard equations 
according to Segel, I, H., Enzvme Kinetics, J. Wiley & Sons, (1975). It should also be 
understood that uncompetitive or non-competitive inhibitors according to this invention 
may bind to the accessory binding site. 

The term "homolog" means a protein having at least 25% amino acid sequence 
identity with MURG or any functional part of MURG, and including certain invariant 
amino acid residues corresponding to G14, G15, G18, H19, G104, H124, E125, G190, 
G191, S192, G194, A195, R261, G263, A264, E269, P281, Q289, N292 and A293 (as 
numbered in the E.coli MurG sequence set forth in Figure 3a) and also including three 
glycine rich loops. A homolog may contain some or all of the invariant residues. 

The term "co-complex" means MURG or a mutant or homologue of MURG in 
covalent or non-covalent association with a chemical entity or compound. 

The term "associating with" refers to a condition of proximity between a chemical 
entity or compound, or portions thereof, and a MurG molecule or portions thereof. The 
association may be non-covalent-wherein the juxtaposition is energetically favored by 
hydrogen bonding or van der Waals or electrostatic interactions-or it may be^ovalent. 

The term ".beta.-sheet" refers to the conformation of a polypeptide chain 
stretched into an extended zig-zig conformation. Portions of polypeptide chains that run 
"parallel" all run in the same direction. Polypeptide chains that are "antiparallel" run in 
the opposite direction from the parallel chains. 

The terms "atomic coordinates" or "structure coordinates" refer to mathematical 
coordinates derived from mathematical equations related to the patterns obtained on 
diffraction of a monochromatic beam of X-rays by the atoms (scattering centers) of a 
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MurG molecute in crystal form. The diffraction data are used to calculate an electron 
density map of the repeating unit of the crystal. The electron density maps are used to 
establish the positions of the individual atoms within the unit cell of the crystal. 

The term "heavy atom derivatization" refers to the method of producing a 
chemically modified form of a crystal of MURG. In practice, a MurG crystal is soaked in 
a solution containing heavy metal atom salts, or organometallic compounds, e.g., lead 
chloride, gold thiomalate, thimerosal, uranyl acetate or. mercuric chloride, which can 
diffuse through the crystal and bind to the surface of the protein. The location(s) of the 
bound heavy metal atom(s) can be determined by X-ray diffraction analysis of the soaked 
crystal. This information, in turn, is used to generate the phase information used to 
construct three-dimensional structure of the enzyme. Blundel,.T. L. and N. L. Johnson, 
Protein Crystallography, Academic Press (1 976). 

Those of skill in the art understand that a set of structure coordinates determined 
by X-ray crystallography is not without standard error. For the purpose of this invention, 
any set of structure coordinates for MURG or MURG homologues or MURG mutants 
that have a root mean square deviation of protein backbone atoms (N, C.alpha., C and O) 
of less than 0.75 A when superimposed-using backbone atoms-on the structure 
coordinates listed in Table 1 , Table 2 or Table 3 shall be considered identical. 

The term "unit cell" refers to a basic parallelepiped shaped block. The entjre 
volume of a crystal may be constructed by regular assembly of such blocks. Each unit 
cell comprises a complete representation of the unit of pattern, the repetition of which 
builds up the crystal. 

The term "space group" refers to the arrangement of symmetry elements of a 

crystal. • 

The term "molecular replacement" refers to a method that involves generating a 
preliminary model of a MurG crystal whose structure coordinates are. unknown, by 
' orienting and positioning a molecule whose structure coordinates are known (e.g., 
MURG coordinates from Table 1 . 2, or 3) within the unit cell of the unknown crystal so 
as best to account for the observed diffraction pattern of the unknown crystal. Phases can 
' then be calculated from this model and combined with the observed amplitudes to give 
"an approximate Fourier synthesis of the structure whose coordinates are unknown. This, 
in 'turn, can be subject to any of the several forms of refinement to provide a final, 
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accurate structure of the unknown crystal. Lattman, E., ^e of the Rotation and 
Translation Functions", in Methods in Enzymology, 115, pp. 55-77 (1985); M. G. 
Rossmann, ed., "The Molecular Replacement Method", Int. Sci. Rev. Ser., No. 13, 
Gordon & Breach, New York, (1972). Using the structure coordinates of MURG 
provided by this invention, molecular replacement may be used to determine the 
structure coordinates of a crystalline mutant or homologue of MURG or of a different 
crystal form of MURG. 



DETAILED DESCRIPTION OF TH E INVENTION 
The present invention relates to the discovery of the three-dimensional structure 
of the crystalline form of the E. coli MurG protein, models of such three-dimensional 
structures, a method of structure based drug design using such structures, methods to 
identify ligands or compounds that interact or bind with such structures, the compounds 
identified by such methods, and the use of such compounds in therapeutic compositions. 

More particularly, the present invention relates to novel crystals of E coli MurG 
protein, methods of production of such crystals, three dimensional coordinates of MurG 
protein, MurG structures and models derived from the E. coli MurG structure, and uses 
of such structures and models to derive other MurG structures and in ligand discovery 

and drug design strategies. 

The present invention also relates to three-dimensional structures and coordinates 
of the donor nucleotide binding site, the acceptor binding site, and the membrane 
association site of the MurG protein, structures and models of the binding sites, and uses 
of such structures and models to derive the binding sites of other MurG proteins and in 

drug design strategies. 

Solely for ease of explanation, the description of the invention is divided into the 
following sections: (1) crystals of MurG protein; (2) methods of crystallization; (3) three- 
dimens.onal crystal coordinates and structure of E. coli MurG; (4) three-dimensional 
coordinates and structure of the donor nucleotide binding site of MurG; (5) coordinates 
and structure of the acceptor binding site of MurG; (5) three dimensional coordinates and 
structure of the membrane association site; (6) two dimensional and three dimensional 
images of the protein, cc-carbon backbone, a-carbon backbone with conserved amino 
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acid residues^Kd binding sites; and (7) computer reaoWe mediums comprising the " ' 
three dimensional coordinates of the MurG protein, a-carbon backbone, a-carbon 
backbone with conserved amino acid residues, and binding sites; (8) images of structures 
of MurG protiensand binding sites; (9) models of MurG proteins and binding sites 
thereof and methods of using the structure of MurG to determine the structures of other 
MurG proteins and binding sites; (10) structure based drug design using models of MurG 
protein and binding site structures; (11) compounds derived from structure based drug 
design; and (12) therapeutic compositions using drugs designed from structure based 
drug design; 

CRYSTALS 

One embodiment of the present invention includes a composition comprising a 
MurG protein in a crystalline form (i.e., MurG crystals). As used herein, the terms 
(crystalline MurG" and "MurG crystal" both refer to crystallized MurG protein and are 
intended to be used interchangeably. More particularly, an embodiment of the present 
invention includes a composition comprising an E. coli MurG protein in a crystalline 
form. Preferably, a crystalline MurG is produced using the crystal formation method 
described herein; in particular according to the method disclosed in Example 1. A MurG 
crystal of the present invention comprises any crystal structure and preferably 
precipitates as a triclinic crystal. Preferably, a composition of the present invention 
includes MurG crystal molecules arranged in a crystalline manner in a PI space group 
with two molecules per assymmetric unit so as to form a unit cell of dimensions 
a=60.613 A, b=66.356 A, c=67.902 A, a=64.294, p=83.520, y-65,448. A preferred 
• crystal of the present invention provides X-ray diffraction data for determination of 
atomic coordinates to a resolution of about 3.0 A, preferably to about 2.4 A\ and more 

preferably to about 1 .8 A. 

Another embodiment of the present invention includes crystalline MurG protein 
' co-crystallized with a donor nucleotide or substrate or substrate analog. Preferably, a 
donor nucleotide is UDP or UDP-GlcNAc (UDP-N-acetylglucosamine) or an analog 
thereof. The substrate or substrate analog is preferably Lipid I or Upid IT, or analogs of 
Lipid I or Lipid II. More specifically, Lipid I and II analogs are as described in 
PCT/US99/02187, published- as . WQ99/38958 . and US Provisional Application Nos. 



15 



WO 01/90301 ^fe, PCT/USOl/11500 



60/122,966 filed March 3, 1999 and 60/137,696 filed June 4, 1999, and International 
Application No. PCT/USOO/05554 entitled "Bacterial transglycosylases: Assays for 
monitoring the activity using lipid II substrate analogs and methods for discovering 
antibiotics," all incorporated herein by reference in their entirety. 

Included in the present invention, a variety of MurG proteins from numerous 
organisms can be used to prepare MurG crystals, including but not limited to, 
microorganisms such as bacteria, h.gher-order bacteria, thermal stable bacteria, 
spirochetes, small pathogenic organisms, fungi, protozoa, cyanobacteria, and 
trypanosomes. More particularly, bacteria such as but not limited to, Escherichia coli. 
Bacillus subtilis, Aquefex aeolicus, Borrelia burgdorferi. Chlamydia pneumoniae. 
Chlamydia trachomatis. Enterococcus faecais, Enterococcus hirae. Haemophilus 
influenzae. Helicobacter pylori J99. Helicobacter pylori. Mycobacterium tuberculosis. 
Porphyromonas gingivalis, Rickettsia prawazekii.Streptomyces coelicolor. Streptomyces 
collinus. Streptococcus pneumoniae. Synechocystis sp. (strain PCC6803), Thermotoga 
maritime, and Treponema pallidum. 

In another embodiment of the present invention, the MurG proteins or fragments 
thereof, mutants or homologs are expressed in, for example, an E. coli host cell for use 
expressing sufficient quantities of sufficiently purified protein to form crystals. The 
present inventors have demonstrated that it is possible to express Enterococcus. faecalis 
MurG in E. coli cells - so the MurG proteins from many organisms can be cloned into 
expression vectors suitable for expression in E. coli cells. This would facilitate obtaining 
sufficient quanitites of isolated or purified MurG proteins. The expression of Erfaecalis 
MurG protein in E. coli host cells is performed, for example, by expressing the E. 
faecalis MurG gene cloned into a pET21b expression vector and transformed into an E. 
coli host cell. The MurG protein is over-expressed with a C-terminal his tag 
(LEHHHHHH) which allows the protein to be purified using a His-tag affinity column. 
The protein is then crystallized and the atomic coordinates are determined using X-ray 
diffraction and methods known to those skilled in the art. 

It is another embodiment of the present invention to provide for the construction 
and expression of chimeric MurG proteins to enable the crystallization and determination 
of the three dimensional coordinates of such chimeras. For example, if there are 
problems obtaining or crystallizing MurGs from other organisms, the present invention 
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provides in^mation that makes it possible to make cr^eric proteins containing the 
donor or acceptor binding site from E. coli MurG and the corresponding acceptor or 
donor binding site from another organism. Chimaeric proteins could be easier to express, 
handle, or crystallize. For example, we have found that E. faecalis MurG is more 
difficult to solubiiize that E. coli MurG (requiring more detergent). It is believed that the 
problems are related to the acceptor binding domain having a stronger affinity for the 
bacterial membranes. To overcome this problem, one can attach the donor binding 
domain of E. faecalis to the E. coli. acceptor binding site and determine structure to see 
details of E. faecalis donor binding domain. 

According to the present invention, crystalline MurG can be used to determine 
the ability of a chemical compound to bind to a MurG protein in a manner predicted by a 
structure based drug design method of the present invention. .Preferably, a MurG crystal 
is soaked in a solution containing a chemical compound of the present invention. 
Binding of the chemical compound to the crystal is then determined by methods standard 
in the art. Thereby, the co-crystal of MurG and a compound of interest is determined. 

METHODS OF CRYSTALLIZATION 
The present invention includes a method for producing crystals of MurG proteins, 
comprising: combining MurG protein with a reservoir solution and inducing crystal 
formation to produce MurG crystals. Another embodiment of the present invention, a 
method for producing crystals of MurG protein comprises combining MurG protein with 
UDP-GlcNAc in a 1:3 ratio and with a reservoir solution and inducing crystal formation 

to produce MurG crystals. 

Preferably, crystals of MurG are formed using a solution containing a range of 
MurG protein from about 1 mg/ml to about 20 mg/ml, more preferably above 5 mg/ml, 
limited only by the solubility of the protein, which may vary depending on the specific 

amino acid sequence. 

A reservoir solution contains the buffer, the precipitant, and additives, if 
. necessary. A suitable reservoir buffer of the present invention comprises NaMES (2-[N- 
morphol,no]ethanesulfonic acid, sodium salt) buffer, NaHEPES (N-[2- 
h y d r o X yethyl]piperazine-N'-[2-ethanesulfonic acid, sodium salt) buffer, Tris 
(tris[hydroxymemyl]aminomethane) buffer, and any buffer which has the PKa between 
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5.5 and 8.0. A suitable NaMES buffer solution has a pH rangProm about 5.6-6.5. Most " 
preferably, the NaMES buffer has a pH of about 6.5. The precipitant comprises 
ammonium sulfate, saturated sodium and potassium tartrate and polyethylene glycol. A 
suitable concentration of ammonium sulfate can range from 0.8 M to 1.5 M. Most 
preferably, the ammonium sulfate concentration is about 0.96 M. A suitable additive 
comprises detergents like Triton X-100 and n-octyl-beta-glucoside. The concentration of 
Triton X-100 can range from 0.1% to 1%. Most preferably, the concentration of Triton 
X-100 is 0.4%. 

In a preferred embodiment, MurG crystals are produced by a method comprising 
concentrating MurG protein in a buffer solution, mixing the protein concentrate with 
UDP-GlcNAc in a .1:3 molar ratio, mixing equal volumes of protein solution with a 
. reservoir solution, and inducing crystal formation to produce MurG crystals. 

In a particular embodiment of the invention, MurG crystals are produced by a 
method comprising concentrating MurG protein to 10 mg/ml in a buffer of 20 mM Tris- 
HC1, pH 7.9/150mM NaCl and 50 mM EDTA; mixing the protein concentrate with 
UDP-GlcNAc in a 1:3 molar ratio; mixing equal volumes of protein solution with a 
reservoir solution comprising (0.1 M NaMES, pH 6.5, 0.96 M (NH 4 ) 2 S0 4 , 0.4% 
TRITON® X-100, and 10 mM dithiolthreitol (DTT)), and inducing crystal formation 
using hanging drop vapor-diffusion.. This preferred method is described in greater detail 
in Example 1 . 

Supersaturated solutions of MurG protein can be induced to crystallize by several 
methods mcluding, but not limited to, vapor diffusion, liquid diffusion, batch 
crystallization, constant temperature and temperature induction or a combination thereof. 
Preferably, supersaturated solutions of MurG protein are induced to crystallize by vapor 
diffusion (i.e., hanging drop method). In a vapor diffusion method, a MurG protein 
solution is combined with a reservoir solution of the present invention that will cause the 
MurG protein solution to become supersaturated and form MurG crystals at a constant 
temperature. Vapor diffusion is preferably performed under a controlled temperature in 
the range of from about 15°C to about 30°C, more preferably from about 20°C to about 
25°C, and most preferably at a constant temperature of about 22°C. 

In another preferred embodiment, the present invention includes a method to 
produce crystals of MurG protein comprising the steps of: (a) preparing an about 10 
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mg/ml solutionof MUrG protein in a Tris-HCl buffer, (b^ixing UDP-GlcNAc with the " 
MurG protein solution in a 3:1 molar ratio, (g) dropping 2 ul droplet of this protein 
sample onto a coverslip, (d) adding an equal volume of reservoir solution to this droplet 
and inverting this over a well containing about I ml of the reservoir solution; and (e) 
incubating until crystals of MurG form. 

Any isolated MurG protein can be used with the present method. An isolated 
MurG protein can be isolated from its natural milieu or produced using recombinant 
DNA technology (e.g., polymerase chain reaction (PCR) amplification, cloning) or 
chemical synthesis. To produce recombinant MurG protein, a nucleic acid molecule 
encoding a MurG protein can be inserted into any vector capable of expressing the 
nucleic acid in a host cell. Suitable and preferred nucleic acid molecules to include in 
recombinant vectors of the present invention are as disclosed herein. Such suitable and 
preferred nucleic acid molecules include numerous MurG encoding genes that have been 
isolated to date, and that will be isolated in the future. A preferred nucleic acid molecule 
of the present invention encodes a homologue of MurG. Homologues of MurG can be 
recognized by the presence of certain conserved amino acid residues or sequences. 

A sequence alignment for six MurG sequences is shown in fig. 3 A. Highlighted 
' residues include those that are invariant or almost invariant across all MurG proteins. A 
nucleic acid molecule of the present invention can encode any portion of a MurG protein, 
preferably a full-length MurG protein or either of the two domains. A more preferred 
nucleic acid molecule to- include in a . recombinant vector, and particularly in a 
recombinant molecule, includes a nucleic acid molecule encoding a protein having the 
amino acid sequence represented by amino acid sequences of MurG proteins as deposited 
in the NCBI database and are identified with Accession Nos. CAB51993, A71316, 
E70579, C71699, F70195, A43727, JC1275, BVECMG, C EEC AM, 083535, Q9ZK59, 
' ' CAB85280, AAP39020, BAA18775, AAD26629, CAB73295, P37585, Q9ZHA9, 
Q9ZHDC0, Q9ZBA5, Q9X4H4, Q9WY74, P74657, 006224, Q9Z702, 084766, 
• D69552, )67238, O51708, 025770, 007670, O07109, P45065, CAB66324, AAC68356, 
AAF06830, P18579, P17443, P17952, . P16457, P07862, AAE23178, AAD53936, 
CAA18668, CAA38869, CAA38868, CAA38867, CAA38866, AAD08196, BAA01453, 
' BAA01455, BAA01454, AAD19042, CAA45558,.CAA74235, AAD10537, AAD06652, 
AAC95450, CAA14869, AAC73201, AAC65509, AAC67113, AAC45636, CAB08640, 
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AAC22793, AAC07193, BAA24357, CAB13395, BAA01355, AaH5538, 1904153C, 
1808265B, 1808265A, CAA36866, CAA36869, CAA36868, CAA36867, CAA36776, 
and AAA99436. Further, examples of nucleic acid molecules encoding MurG proteins 
have been deposited in NCBI, Genbank, and have Accession Nos. AL162758, " 
AE002281, D90917, AF1 10367, AL 139077, AJ242646, AE000520, AE00051 1, L42023, 
U00096, NC-000922, AE000783, AE000657, AE001348, AF099188, AR048673, 
AR048672, AF179611, AL022602, AL109663, X55034, AE000621, D10602, 
AE001670, X64259, Y13922, U10879, AE001535, AF068902, AJ235271, AEOO0U8, 
AE001227, AE001176, U94707, Z95388, U32793, AE000727, D84504, Z99111, 
D10483.X52644, X52540, and L24773. These sequences are known and are publicly 
available. Further, as additional genomes and genes are sequenced, more MurG 
encoding nucleotide sequences will become available, and can be used in the present 
invention. 

In specific embodiments of the invention, the protein sequence of E. coli MurG 
was reported in 1990 (Ikeda et al. Nucleic Acids Res. 1990, 19:4014; and Mengin- 
Lecreuix, D. et al., Nucleic Acids Res. 1990, 18:2810.). E. coli genomic DNA can be 
purified from E. coli or purchased from ATCC, or the gene for E. coli MurG is cloned 
into a plasmid can be obtained from numerous sources. Primers were designed to the 
portions of the gene corresponding to the N and C termini of the protein. The primers 
also encoded restriction enzyme sites outside the protein coding region. The gene 
sequence was amplified; the corresponding double stranded nucleic acid molecule was 
cut with appropriate restriction enzymes for cloning into a commercially available 
expression vector (pET expression vectors available from Novagen provide for numerous 
variations of MurG protein - wild-type or fusion proteins or proteins with affinity tags at 
N or C terminus. We have worked with several constructs but found that MurG with a 
His-tag at C-terminus crystallized best; the protein sequence contained an extra 
methionine at N-terminus and eight extra residues at C terminus, six of which were 
histidines. The vector used was pET21b. (as described in Ha et al. J. Am. Chem. Soc. 
121,(1 999) 841 5-8426 hereby incorporated by reference in its entirety). 

A recombinant vector of the present invention can be either RNA (probably not) 
or DNA, and typically includes, but is not limited to, a virus or plasmid. Any 
recombinant vector and host cell that provides for expression of a MurG protein 
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encoding mucleic acid sequence can be used in the present invention to express MurG 
protein for crystallization. Preferred vectors are engineered for high level expression in 
E. coli such as, but not limited to, pET vectors. We have found that over-expression of 
Murg from either E. coli or E. faecalis in E. coli cells is not toxic and, thus, this approach 
will work for other MurG proteins. 

As used herein, an expression vector is a DNA vector that is capable of 
transforming a host cell and of affecting expression of a specified nucleic acid molecule. 
Expression vectors of the present invention include any vectors that funct.on (i.e., direct 
gene expression) in recombinant cells of the present invention, including bacterial, 
fungal, and other microorganisms cells. Preferred expression vectors of the present 
invention direct expression in bacterial cells from a plasmid. A preferred recombinant 
molecule of the present invention comprises pET21b with E. coli MurG gene cloned into 

theNdel andXhol sites. 

An expression vector of the- present invention can be transformed into any 
suitable host cell to form a recombinant cell. A suitable host cell includes any cell 
capable of expressing a nucleic acid molecule inserted into the expression vector. For 
example, a procaryotic expression vector can be transformed into a bacterial host cell. If 

the expression vector contains a T7 -promoter then a source of T7 RNA polymerase must 
be provided to induce expression. Some host cells contain the T7 RNA polymerase gene 
in a repressed state. Expression of T7 RNA polymerase can be induced wUh a chemical 
signal such as IPTG or heat. Alternatively, a source of T7 RNA polymerase can be 
introduced at the appropriate- time by infection with a phage containing a copy of T7 
RNA polymerase. A wide range of hosts strains can be infected with a suitable phage. 
Some host strains have been engineered to contain inducible copies of T7 RNA 
polymerase gene. Such host strains include BL21.(DE3) and derivatives thereof. A 
preferred host strain of the present invention is BL21.(DE3)pLysS or BL21(DE3) P LysE, 
' which are commercially available from Novagen and can be readily transformed with a 
DNA plasmid vector containing a MurG gene under the control of the T7 promoter. As 
already stated above, ■ a preferred vector is a pET vector, preferably containing a 
' . restriction enzyme site permitting cloning of the gene as a fusion containing a C-term.nal 
his tag. •'*■■• • ; 
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eferred embodiment, one method to isolate^u 



In a preferred embodiment, one method to isolate^TurG protein useful for 
producing MurG crystals includes recovery of MurG protein having a C-terminal 
LEHHHHHH (His tag) sequence purified as described in Ha et al. (1999, J. Amer. 
Chem. Soc. 121:8415-8426). One of skill in the art is able to modify this procedure in 
order to purify other proteins can be produced as C-terminal histadine (his) tags. The 
purification conditions for specific MurG proteins will vary depending upon the 
particular characteristics of the proteins such as their isoelectric point, molecular weight, 
etc. It is known that the isoelectric points of different Murg homologues vary a bit, 
although they are generally relatively high. Also, some Murg homologues may be more 
hydrophobic than others, which will mean differences in amount of detergent necessary 
for purification. It is likely that all the Murg homologues can be purified over nickel 
affinity columns using the C-terminal his-tag as a handle. Those skilled in the art of 
protein purification will know how to modify purification parameters depending upon the 
protein characteristics, in order to purify the protein for crystallization. 

STRUCTURE OF MURG PROTErN 
One embodiment of the present invention includes a model of a MurG protein, in 
which the model represents a three dimensional structure of a MurG protein. Another 
embodiment of the present invention includes the three dimensional structure of a MurG 
protein. A three dimensional structure of a MurG protein encompassed by the present 
invention substantially conforms with the atomic coordinates represented in Table 1. 
According to the present invention, the use of the term "substantially conforms" refers to 
at least a portion of a three dimensional structure of a MurG protein which is sufficiently 
spatially similar to at least a portion of a specified three-dimensional configuration of a 
particular set of atomic coordinates (e.g., those represented by Table 1) to allow the three 
dimensional structure of another MurG protein to be modeled or calculated using the 
particular set of atomic coordinates defining the three dimensional configuration of the 
MurG protein. For example, but not meant to be a limitation, homology modeling can be 
done using the linear sequence of a different MurG and E. coli coordinates; molecular 
replacement can allow the solution of a different MurG structure using the E. coli MurG 
coordinates and experimental data such as x-ray diffraction pattern from a different 
MurG crystal. According to the present invention, a three dimensional structure of a 
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given portion or chain of a first MurG protein can subsfantially conform to at least a 
portion of the atomic coordinates which represent a three dimensional configuration of a 
second MurG. 

More particularly, a structure that substantially conforms to a given set of atomic 
coordinates is a structure wherein at least about 50% of such structure has an average 
root-mean-square deviation (RMSD) of less.than about 2.5 A for the a-carbon or C-alpha 
backbone atoms in secondary structure elements in each domain, and more preferably, 
less than about 2.0 A for the C-alpha backbone atoms in secondary structure elements in 
each domain, and, in increasing preference, less than about 1.5 A, less than about 1.0 A, 
less than about 0.7 A, and more preferably, less than about 0.5 A for the C-alpha 
backbone atoms in secondary structure elements in each domain. In a more preferred 
embodiment, a structure that substantially conforms to a given set of atomic coordinates 
is a structure wherein at least about 75% of such structure has the recited average root- 
mean-square deviation (RMSD) value, and more preferably, at least about 90% of such 
structure has the recited average RMSD value, and most preferably, about 100% of such 
structure has the recited average RMSD value. 

In an even more preferred embodiment, the above definition of "substantially 
' conforms" can be extended to include atoms of amino acid side chains. As used herein, 
the phrase "common amino acid side chains" refers to amino acid side chains that are 
common to both the structure which substantially conforms to a given set of atomic 
coordinates and the structure that is actually represented by such atomic coordinates. 
Preferably, a three dimensional structure that substantially conforms to a given set of 
atomic coordinates is a structure wherein at least about 50% of the common amino acid 
side chains have an average RMSD value of less than about 1.5 A, and more preferably, 
less than about 13 A, and in increasing preference, less than about 1.0 A, less' than about 
0.7 A, and most preferably, less than about 0.3 A. 

In a more preferred embodiment, a structure that substantially conforms to a 
g,ven set of atomic coordinates is a structure wherein at least about 75% of the common 
amino acid side chains have the recited average RMSD value, and more preferably, at 
least about 90% of the common amino acid side chains have the recited average RMSD 
value, and most preferably, about 100% of the common amino acid side chains have the 
recited average RMSD value. 
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In more preferred embodiments of the present inv^Kin, a large number of 
different "rotamers" or "rotational isomers" of the MurG protein are encompassed by 
three dimensional structures of the invention in which the amino acid side chains are at a 
variety of positions in crystalline forms of the protein or for the protein in solution. 
Different rotamers refer to molecules of identical configuration may be distinguished as 
having different conformations after rotation about the various molecular bonds. 
Therefore, while the same or similar amino acids may be present, the exact location will 
vary depending upon the freedom of rotation of the bonds due to hydrogen bonding, and 
other molecular forces. 

STRUCTURE OF THE a-CARBON BACKBONE OF MURG AND THE a- 
CARBON BACKBONE AND CONSERVED AMINO ACID RESIDUES 
The present invention includes the three dimensional structure of the a-carbon or 
C-alpha backbone of a MurG protein, in particular the E. coli MurG protein. A three 
dimensional structure of the C-alpha backbone of the MurG protein encompassed by the 
present invention substantially conforms with the atomic coordinates represented in 
Table 2. 

More particularly, a structure that substantially conforms to a given set of atomic 
coordinates is a structure wherein at least about 50% of such structure has an average 
root-mean-square deviation (RMSD) of less than about 2.5 A for the C-alpha backbone 
atoms in secondary structure elements in each domain, and more preferably, less than 
about 2.0 A for the C-alpha backbone atoms in secondary structure elements in each 
domain, and, in increasing preference, less than about 1 .5 A, less than about 1 .0 A, less 
than about 0.7 A, and more preferably, less than about 0.5 A for the C-alpha backbone 
atoms in secondary structure elements in each domain. In a more preferred embodiment, 
a structure that substantially conforms to a given set of atomic coordinates is a structure 
wherein at least about 75% of such structure has the recited average root-mean-square 
deviation (RMSD) value, and more preferably, at least about 90% of such structure has 
the recited average RMSD value, and most preferably, about 100% of such structure has 
the recited average RMSD value. The C-alpha backbone of MurG proteins is expected to 
be more conserved than the location of the particular amino acid residue side chains. 
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The present invention also includes the three dimensional structure of the ct- 
carbon or C-alpha backbone and conserved or invariant amino acid residue side chains of 
a MurG protein, in particular the E. coli MurG protein. A three dimensional structure of 
the C-alpha backbone and conserved amino acid residues of the MurG protein 
encompassed by the present invention substantially conforms with the atomic 
coordinates represented in Table 3. The conserved amino acids are highlighted in blue 
in Figure 3a and include G14, G15, G18, H19, G104, H.124, E125, G190, G191, SI 92, 
G194, A195, R261, G263, A264, E269, P281, Q289, N292 and A293 (as numbered in 
the E. coli MurG sequence set forth in Figure 3a). 

More particularly, a structure that substantially conforms to a given set of atomic 
coordinates is a structure wherein at least about 50% of such structure has an average 
root-mean-square deviation (RMSD) of less than about 2.5 A for the C-alpha backbone 
and conserved amino acid residue atoms in secondary structure elements in each domain, 
and more preferably, less than about 2.0 A for the backbone atoms in secondary structure 
elements in each domain, and, in increasing preference, less than about 1.5 A, less than 
about 1.0 A, less than about 0.7 A, and more preferably, less than about 0.5 A for the 
backbone atoms in secondary structure elements in each domain. In a more preferred 
embodiment, a structure that substantially conforms to a given set of atomic coordinates 
is a structure wherein at least about 75% of such structure has the recited average root- 
mean-square deviation (RMSD) value, and.more preferably, at least about 90% of such 
' structure has the recited average RMSD value, and most preferably, about 100% of such 
structure has the recited average RMSD value. 

STRUCTURE OF THE DONOR NUCLEOTIDE BrNDlNG SITE OF 

MURG PROTEINS ^ 
An embodiment of the present invention includes the three dimensional structure 
•' of a donor nucleotide binding site of a MurG protein, in particular an E. coli MurG 
' protein. A more preferred embodiment of the present invention includes a three 
dimensional structure of a donor nucleotide binding site of a MurG protein wherein the 
three dimensional structure of the donor nucleotide binding site substantially conforms to 
: the atomic coordinates in Table 4: In a preferred embodiment, the donor nucleotide 
binding site is a UDP-GlcNAc binding site of a MurG protein. 
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As described in Example 1, the donor nucleotide binding site is located in the C- 
terminal domain (see Fig. 4a). This binding site is based on the comparison of P- 
glucosyltransferase (BGT) and E. coli MurG and based on experiments done in our 
laboratory showing that the isolated C domain binds to a UDP-hexose column (See 
Example 1). The atomic coordinates of Table 4 set forth the donor nucleotide binding 
site three dimensional structure without a donor nucleotide such as LfDP-GlcNAc bound 
to the MurG protein. 

According to the present invention, the use of the term "substantially conforms" 
refers to at least a portion of a three dimensional structure of a donor nucleotide binding 
site of a MurG protein which is sufficiently spatially similar to at least a portion of a 
specified three-dimensional configuration of a particular set of atomic coordinates (e.g., 
those represented by Table 4) to allow the three dimensional structure of the donor 
nucleotide binding domain to be modeled or calculated (i.e., by molecular replacement) 
using the particular set of atomic coordinates defining the three dimensional 
configuration of the donor nucleotide binding site of a MurG protein. According to the 
present invention, a three dimensional structure of a given donor nucleotide binding site 
of a first MurG protein can substantially conform to at least a portion of the atomic 
coordinates which represent a three dimensional configuration of a second MurG. Since 
the atomic coordinates of Table 4 were obtained from the E. coli MurG crystal protein 
without a donor nucleotide bound, there will be some variation from the atomic 
coordinates of the donor nucleotide binding site when a nucleotide is bound vs. unbound. 
Therefore, a structure "substantially conforming" to that represented by the atomic 
coordinates in Table 4, will include a structure obtained from co-crytallization of the 
protein with a donor nucleotide. 

More particularly, a structure that substantially conforms to a given set of atomic 
coordinates is a structure wherein at least about 50% of such structure has an average 
root-mean-square deviation (RMSD) of less than about 1.5 A for the C-alpha backbone 
atoms in secondary structure elements in each domain, and more preferably, less than 
about 1.3 A for the C-alpha backbone atoms in secondary structure elements in each 
domain, and, in increasing preference, less than about 1.0 A, less than about 0.7 A, and 
more preferably less than about 0.5 A for the C-aipha backbone atoms in secondary 
structure elements in each domain. In a more preferred embodiment, a structure that 
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substantially conforms to a given set of atomic coordinates is a structure wherein at least 
about- 75% of such structure has the recited average root-mean-square deviation (RMSD) 
value, and more preferably, at least about 90% of such structure has the recited average 
RMSD value. ...... 

- In an even more preferred embodiment, the above definition of "substantially 
conforms" can be extended to include atoms of the. conserved or invariant amino acid 
side chains located within the binding site. As used herein, the phrase "conserved amino 
acid side chains" refers to amino acid side chains that are conserved between MurG 
proteins within the donor nucleotide binding site. The conserved amino acid residues of 
the donor nucleotide binding site have been identified as 1125, R261, G263, A264, E269, 
P281, Q289, N292 and A293 (as numbered in the E. coli MurG sequence set forth in 
Figure 3a) and the G loop found between residues numbered 190-195 having residues 
G190, G191, SI 92, G194, and A 195. Some or all of these conserved residues are 
necessary for binding the nucleotide donor. 

Preferably, a three dimensional structure that substantially conforms to a given 
set of atomic coordinates is a structure wherein at least about 50% of the conserved 
amino acid side .chains have an average RMSD value of less than about 1.5 A, and more 
preferably, less than about 1.3 A, and in increasing preference, less than about 1.0 A, less 
than about 0.7 A, and most preferably, less than about 0.3 A, In a more preferred 
embodiment, a structure that substantially conforms to a given set of atomic coordinates 
is a structure wherein at least about 75% of the conserved amino acid side chains have 
the recited average RMSD value, and more preferably, at least about 90% of the 
conserved amino - acid side chains have , the recited average RMSD value, and most 
preferably, about 100% of the conserved amino acid side chains have the recited average 
RMSD value. 

STRUCTURE OF- THE ACCEPTOR BINDING SITE OF MURG PROTEIN 
An embodiment of the present invention includes the three dimensional structure of an 
acceptor binding site of a MurG protein. . A three dimensional structure of a acceptor 
binding site of a MurG protein encompassed by the present invention substantially 
conforms with the . atomic coordinates represented , in Table 5. A more preferred 
embodiment of the present .invention includes a .three dimensional structure of an 
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acceptor binding site of a MurG protein wherein the three dimensional structure of the 
acceptor binding site substantially conforms to the atomic coordinates Table 5. 

According to the present invention, the use of the term "acceptors" refers to 
Lipid I and analogues thereof. For the purposes of obtaining co-crystals containing 
acceptor analogues bound to the acceptor binding site better, the analogues need not be 
functional acceptors in a MurG assay. In particular embodiments of the present 
invention, the acceptor is selected from the group consisting of, but not limited to Lipid I, 
and analogs of Lipid I (see compounds described in Ha et al., J. Amer. Chem. Soc. 1999, 
vol. 121 :8415-26, incorporated by herein by reference in its entirety). 

As described in Example 1, the acceptor binding site is located in the N- 
terminal domain of a MurG protein (see Fig. 3a and 4c). The acceptor binding site or 
domain is characterized by three highly conserved regions, two of which are glycine-rich 
loops (also referred to as "G loops*') that face the cleft between the C-terminal and N- 
terminal domains. The conserved residues of the acceptor binding site comprise G14, 
G15, G18, H19, G104, HI 24, and E125 (as numbered in the E. coli MurG sequence set 
forth in Figure 3a) and two conserved G loop structures. 

According to the present invention, the use of the term ''substantially conforms" 
refers to at least a portion of a three dimensional structure of an acceptor binding site of a 
MurG protein which is sufficiently spatially similar to at least a portion of a specified 
three-dimensional configuration of a particular set of atomic coordinates (e.g., those 
represented by Table 5) to allow the three dimensional structure of the acceptor binding 
site to be modeled or calculated (i.e., by homology modeling) using the particular set of 
atomic coordinates defining the three dimensional configuration of the acceptor binding 
site of a MurG protein. According to the present invention, a three dimensional structure 
of a given acceptor binding site of a first MurG protein can substantially conform to at 
least a portion of the atomic coordinates which represent a three dimensional 
configuration of a second MurG. 

In an even more preferred embodiment, the above definition of "substantially 
conforms" can be extended to include atoms of the conserved amino acid side chains. As 
used herein, the phrase "conserved amino acid side chains" refers to the conserved or 
invariant amino acid side chains that are common to MurG proteins. Preferably, a three 
dimensional structure that substantially conforms to a given set of atomic coordinates is a 
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structure whSn at least about 50% of the conserved OTino acid side chains have an 
average RMSD value of less than about 1.5 A, and more preferably, less than about 1.3 
A, and in increasing preference, less than about 1.0 A, less than about 0.7 A, and most 
preferably, less than about 0.3 A. In a more preferred embodiment, a structure that 
substantially conforms to a given set of atomic coordinates is a structure wherein at least 
about 75% of the conserved amino acid side chains have the recited average RMSD 
value, and more preferably, at least about 90% of the conserved amino acid side chains 
have the recited average RMSD value, and most preferably, about 100% of the conserved 
amino acid side chains have the recited average RMSD value. 



STRUCTURE OF A MEMBRANE ASSOCIATION SITE OF MurG PROTEIN 
An embodiment of the present invention includes the three dimensional structure of a 
membrane association site of a MurG protein. A. three dimensional structure of a 
membrane association site of a MurG protein encompassed by the present invention 
substantially conforms with the atomic coordinates represented in Table 6. A more 
preferred embodiment, of the present invention includes a three dimensional structure of 
an acceptor binding site of a MurG protein wherein the three dimensional structure of the 
• acceptor binding site substantially conforms to the atomic coordinates in Table 6. 

According to the present invention, the use of the term "membrane association 
site" refers to the region of a MurG protein that associates with cytoplasmic surface of 
bacterial membranes where it performs the reaction of coupling a soluble donor sugar to 
the membrane anchored acceptor sugar, Lipid I. Analysis of the E. coli MurG protein 
structure shows a hydrophobic patch consisting of residues 175, L79, F82, W85, and 
.VV116 in the N-domain. The membrane association site is where the MurG protein 
associates with the bacterial membranes, and that it is target for inhibitors if we find that 
a) we can bind to it with another molecule; b) we can disrupt membrane association by 
binding to it; or c) disrupting membrane association inhibits activity. 

As described in Example I, the membrane association site is located in the N- 
■ terminal domain of a MurG- protein (see Fig. 4c). The location of the membrane 
" association site, is in close proximity to the acceptor binding site and membrane 



29 



WO 01/90301 PCT/US01/U500 

association in this patch would bring the two M-terminal G-!oops close to the membrane 
surface where the diphosphate portion of the acceptor is located. 

According to the present invention, the use of the term "substantially conforms" 
refers to at least a portion of a three dimensional structure of a membrane association site 
of a MurG protein which is sufficiently spatially similar to at least a portion of a 
specified three-dimensional configuration of a particular set of atomic coordinates (e.g., 
those represented by Table 6) to allow the three dimensional structure of the membrane 
association site to be modeled or calculated (i.e., by molecular replacement) using the 
particular set of atomic coordinates defining the three dimensional configuration of the 
membrane association site of a MurG protein. According to the present invention, a 
three dimensional structure of a given membrane association site of a first MurG protein 
can substantially conform to at least a portion of the atomic coordinates which represent 
a three dimensional configuration of a second MurG. 

More particularly, a structure that substantially conforms to a given set of atomic 
coordinates is a structure wherein at least about 50% of such structure has an average 
root-mean-square deviation (RMSD) of less than about 1.5 A for the structural elements 
in the site, and more preferably, less than about 1 .3 A for the structure elements in each 
site, and, in increasing preference, less than about 1 .0 A, less than about 0.7 A, less than 
about 0.5 A, and more preferably, less than about 0.3 A for the structural elements in 
each site. In a more preferred embodiment, a structure that substantially conforms to a 
given set of atomic coordinates is a structure wherein at least about 75% of such structure 
has the recited average root-mean-square deviation (RMSD) value, and more preferably, 
at least about 90% of such structure has the recited average RMSD value, and most 
preferably, about 100% of such structure has the recited average RMSD value. 

In an even more preferred embodiment, the above definition of ''substantially 
conforms" can be extended to include atoms of ct-carbon backbone and conserved amino 
acid side chains. As used herein, the phrase "conserved amino acid side chains" refers to 
amino acid side chains that are conserved between MurG proteins. Preferably, a three 
dimensional structure that substantially conforms to a given set of atomic coordinates is a 
structure wherein at least about 50% of the conserved a-carbon backbone and conserved 
amino acid side chains have an average RMSD value of less than about 1.5 A, and more 
preferably, less than about 1.3 A, and in increasing preference, less than about 1.0 A, less 
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than about 0.7 A, arid most preferably, less than about 0.3 A. In a more preferred 
embodiment, a structure that substantially conforms to a given set of atomic coordinates 
is a structure wherein at least about 75% of the a-carbon backbone and conserved amino 
acid side chains have the recited average RMSD value, and more preferably, at least 
about 90% of the a-carbon backbone and conserved acid side chains have the recited 
average RMSD value, and most preferably, about 100% of the a-carbon and conserved 
amino acid side chains have the recited average RMSD value. 

COMPUTER READABLE MEDIUM 
Another embodiment of the present invention relates to a computer-readable 
medium encoded with, a set three dimensional coordinates seleceted from the group 
consisting of the three dimensional coordinates represented in Tablel, the three 
dimensional coordinates represented in Table 2, the three dimensional coordinates 
represented in Table 3, the three dimensional coordinates represented in Table 4, the 
three dimensional coordinates represented in Table 5, or the three dimensional 
coordinates represented in Table 6, wherein using a graphical display software prqgram, 
the three dimensional coordinates create an electronic file that can be visualized on a 
computer capable of representing said electronic file as a three dimensional image. 
Preferably, the three dimensional image is of a MurG protein, the a-carbon backbone of 
MurG, the a-carbon backbone and conserved amino acid residue sidechains of MurG, the 
' donor nucleotide binding site of MurG, the acceptor binding site of MurG, or the 
membrane association site of MurG. 

Yet' another embodiment of the present invention relates to a computer-readable 
medium encoded with a set of three dimensional coordinates, of . a three dimensional 
structure which substantially conforms to the three dimensional coordinates* represented 
in Table 1, wherein using a graphical display software, program, the three dimensional 
coordinates create an electronic file that can be visualized on a computer capable of 
representing said electronic file as a three dimensional image. In other embodiments, the 
present invention relates to a computer-readable medium encoded with a set of three 
dimensional coordinates of a three dimensional structure which substantially conforms to 
' the three dimensional' coordinates represented in Table 2, Table 3, Table 4, Table 5 or 
Table 6, wherein using a graphical display software program, the three dimensional 
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coordinates create an electronic file that can be visualized™ a computer capable of 
representing said electronic file as a three dimensional image. Preferably, the three 
dimensional image is of a MurG protein, the a-carbon backbone of MurG, the a-carbon 
backbone and conserved amino acid residue sidechains of MurG, the donor nucleotide 
binding site of MurG, the acceptor binding site of MurG, or the membrane association 
site of MurG. 

IMAGES 

One embodiment of the present invention relates to a two dimensional image of 
an E. coli MurG protein including those illustrated in Figures 3-4. Most of these figures 
were drawn with the MOLSCRIPT program. Preferably, the two dimensional image is 
of a MurG protein, the a-carbon backbone of MurG, the a-carbon backbone and 
conserved amino acid residue sidechains of MurG, the donor nucleotide binding site of 
MurG, the acceptor binding site of MurG, or the membrane association site of MurG. 

Another embodiment of the present invention includes a three dimensional 
computer image of the three dimensional structure of a MurG protein, preferably the E. 
coli MurG protein. Suitable structures of which to produce three dimensional computer 
images are disclosed herein. Preferably, a computer image is created to a structure 
substantially conforming with the three dimensional coordinates represented in Table 1. 

Another embodiment of the present invention includes an image of an MurG 
protein that is generated when a set of three dimensional coordinates comprising the 
three dimensional coordinates represented in Table 1 are analyzed on a computer using a 
graphical display software program to create an electronic file of the image and 
visualizing the electronic file as a three dimensional image. Suitable structures to image 
are disclosed herein. Preferably, the three dimensional structures are of a MurG protein, 
the a-carbon backbone of MurG, the a-carbon backbone and conserved amino acid 
residue sidechains of MurG, the donor nucleotide binding site of MurG, the acceptor 
binding site of MurG, or the membrane association site of MurG. Most preferably, the 
MurG protein is the E. coli MurG protein described herein. A computer image of the 
present invention can be produced using any suitable software program, including, but 
not limited to, MOLSCRIPT 2.0 (Avatar Software AB, Helenebrgsgatan 21C, SE-l 1713, 
Stockholm, Sweden), the graphical display program O (Jones et al., Acta 
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CrystallograP^ vol. A47, p. 110, 1991), or the grap^R display program GRASP. 
Suitable computer hardware useful for producing an image of the present invention are 
known to those of skill in the art. Preferred computer hardware includes a Silicon 
Graphics Workstation. 

MODELS OF MURG PROTEINS AND BINDING SITES 
According to the present invention, a three dimensional structure of the E. coli 
MurG protein and its binding sites of the present invention can be used to derive a model 
of the three dimensional structure of another MurG protein and its binding sites (i.e., a 
structure to be modeled).. As used herein, a "structure" of a protein refers to the 
components and the manner of arrangement of the components to constitute a protein or 
binding site. Also, as used herein, the term "model" refers to a representation of a 
tangible medium of the three dimensional structure of a protein, polypeptide or peptide, 
or binding site of a protein. For example, a model can be a representation of the three 
dimensional structure in a electronic file, on a computer screen, on a piece of paper (i.e., 
on a two dimensional medium), and/or as a ball-and-stick figure. Physical three- 
dimensional models are tangible and include, but are not limited to, stick models and 
space-filling models. The phrase "imaging the model on a computer screen" refers to the 
ability to express (or represent) and manipulate the model on a computer screen using 
appropriate computer hardware and software technology known to those skilled in the 
art. Such technology is available from a variety of sources including, for example, Evans 
and Sutherland, Salt Lake City, Utah, and Biosym Technologies, San Diego, CA. The 
phrase "providing a picture of the model" refers to the ability to generate a "hard copy" 
of the model. ■ Computer screen images and pictures of the model can be visualized in a 
number of formats including space-filling representations, a carbon traces, ribbon 
diagrams and electron density maps. 

Suitable target MurG proteins and their associated binding sites to model using a 
method of the present invention include any MurG protein and binding sites that are at 
least in part structurally related to the E. coli MurG protein or its binding sites. A 
preferred target MurG structure that is at least in part structurally related includes a target 
. MurG structure having an amino acid sequence that is at least about 25%, preferably at 
least about 30%, more preferably at least about 36%, more preferably at least about 40%, 
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even more preferrolye at least about 50%, more preferabl^^least about 60%, more 
preferably at least about 70%, more preferably at least about 80%, and more preferably at 
least about 90% identical to an amino acid sequence of the E. coli MurG protein, across 
the full-length of the target MurG structure sequence when using, for example, a 
sequence alignment program such as DNAsis™ program (available from Hitachi 
Software, San Bruno, CA) or the MacVector™ program (available from the Eastman 
Kodak Company, New Haven, CT) or the GCy™ program (available from the "GCy", 
University of Wisconsin, Madison, WT), such alignment being performed for example, 
using the standard default values accompanying such alignment programs. 

Preferred MurG proteins and their binding sites are set forth in the amino acid 
sequences of MurG proteins as deposited in the NCBI database and are identified with 
Accession Nos. CAB51993, A71316, E70579, C71699, F70195, A43727, JC1275, 
BVECMG, CEECAM, 083535, Q9ZK59, CAB85280, AAF39020, BAA18775, 
AAD26629, CAB73295, P37585, Q9ZHA9, Q9ZHDC0, Q9ZBA5, Q9X4H4, Q9WY74, 
P74657, 006224, Q9Z702, 084766, 069552, 067238, 051708, 025770, 007670, 
007109, P45065, CAB66324, AAC68356, AAF06830, P18579, P17443, P17952, 
P16457, P07862, AAE23178, AAD53936, CAA18668, CAA38869, CAA38868, 
CAA38867, CAA38866, AAD08196, BAA01453, BAA01455, BAA01454, AAD19042, 
CAA45558, CAA74235, AAD10537, AAD06652, AAC95450, CAA14869, AAC73201, 
AAC65509, AAC67113, AAC45636, CAB08640, AAC22793, AAC07193, BAA24357, 
CAB13395, BAA01355, AAB35538, 1904153C, 1808265B, 1808265A, CAA36866, 
CAA36869, CAA36868, CAA36867, CAA36776, and AAA99436. The amino acid 
sequences are publicly available. 

A variety of MurG proteins from numerous organisms can be used to prepare 
models of MurG proteins and binding sites, including but not limited to, microorganisms 
such as bacteria, higher-order bacteria, thermal stable bacteria, spirochetes, small 
pathogenic organisms, fungi, protozoa, cyanobacteria, and trypanosomes. More 
particularly, bacteria such as but not limited to, Escherichia coli, Bacillus subtilis, 
Aquefex aeolicus, Borrelia burgdorferi, Chlamydia pneumoniae, Chlamydia trachomatis, 
Enter -ococcus faecais, Enterococcus hirae, Haemophilus influenzae. Helicobacter pylori 
J99, Helicobacter pylori, Mycobacterium tuberculosis, Porphyromonas gingivalis, 
Rickettsia prowazekii, Streptomyces coelicolor, Streptomyces collinus. Streptococcus 
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pneumoniattynechocystis sp. (strain PCC6803), ThernXga maritime, and Treponema 
pallidum. It is noted that nucleotide and amino acid sequences for many of the above 
identified organisms are known and publicly available. 

Preferred target MurG proteins and binding site structures to model also include, 
but are not limited to, derivatives of MurG proteins, such as a MurG protein having one 
or more amino acid residues substituted, deleted or added (referred to herein as MurG 
mutants), or proteins encoded by natural variants of a nucleic acid molecule encoding a 
MurG. 

In another embodiment of the invention, the process of building a homology model 
for a protein is divided into the following steps: 

(1) Determine which proteins are related to the model protein; 

(2) Determine structurally conserved regions (SCRs); 

(3) . Align the amino acid sequence of the unknown protein with those of the reference 
protein(s) within the SCRs; - 

(4) Assign coordinates in the conserved regions; 

(5) . Predict conformations for the rest of the peptide chain, including loops between the 
SCRs and possibly the N- and C-termini; 

(6) Search for the optimum side chain conformations for residues that differ from those 
in the reference proteins; and 

(7) Use energy minimization and molecular dynamics to refine the molecular structure 
so that steric strain introduced during the model-building process can be relieved. 

Published sequences are readily available through on-line databases on the Internet, 
. such as SwissProt (http://www.expasy.ch/sprot/sprot-top.html). MurG specific and 
related sequences are obtained for use for building homology models by text-based or 
. ■ sequence. similarity searching. SCRs for MurG is the entire protein/considering the E, 
■ ■ coli MurG crystal structure is the only similar sequence with structural data. Alignment 
■ ■ of the sequences using an appropriate alignment program and algorithm, such as Clustal 
W, allows appropriate assignment of the E. coli. protein coordinates to a MurG sequence 
• of unknown structure. The Modeler program performs the conformational predictions 
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for the peptide chain and side chains. Dynamics and minimmftion using an appropriate* 
program and algorithm, such as Discover. 



Modeler Description: 

Modeler is an automated homology-modeling scheme designed to find the most 
probable three-dimensional structure of a protein, given its amino acid sequence and its 
alignment with related structures. It derives 3D protein models without the time 
consuming separate stages of core region identification and loop region building or 
searching that is inherent to manual homology modeling schemes. The related or 
reference protein structures are used to derive spatial restraints expressed as probability 
density functions (P.DFs) for each of the restrained features of the model. As an example, 
the main chain conformation of a given residue in the model will be described by 
restraints that depend upon the residue type, the main chain conformation of equivalent 
residues in the reference proteins and the local sequence similarity. The probability 
distribution functions that are used in restraining the model structure are derived from 
correlations between structural features in a database of families of homologous proteins 
aligned on the basis of their 3D structure. These functions are used to restrain C-C 
distances, main chain N~ O distances, main chain and side chain dihedral angles, etc. The 
individual restraints are assembled into a single molecular probability density function 
(MPDF). The three-dimensional protein model is then obtained by an optimization of 
this MPDF. The optimization procedure itself consists of a variable target function 
method (Braun and Go, 1985) with conjugate gradient minimization scheme followed by 
an optional restrained simulated annealing molecular dynamics scheme. 
While several reference structures are used in the traditional homology model building 
process, only one set of coordinates can be used in any one peptide segment. Modeler is 
able to simultaneously incorporate structural data from one or more reference proteins. 
Structural features in the reference proteins are used to derive spatial restraints which in 
turn are used to generate model protein structures using conjugate gradient and simulated 
annealing optimization procedures. 

Clustal W description: 
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Clus^T W aligns multiple sequences using a ^^Wgressive pairwise alignment- 
algorithm. It first generates all possible pairwise alignments for a list of sequences and 
then builds the guide tree based on their pairwise sequence identity, aligning the 
sequences following the order of the guide tree. 

Several unique features in Ciustal W improve the sensitivity of the alignment of 
divergent protein sequences (Thompson et al, 1994a). 

(1) Individual weights are assigned to each sequence in a partial alignment in order to 
downweight near-duplicate sequences and upweight the most divergent ones. 

(2) Amino acid substitution matrices are varied at different alignment stages according 
to the divergence of the sequences to be aligned. 

(3) Residue-specific gap penalties and locally reduced gap penalties in hydrophilic 
regions encourage new gaps in potential loop regions rather than regular secondary 
structure. 

(4) Positions in early alignments, where gaps have been opened, receive locally reduced 
gap penalties to encourage the opening of new gaps at these positions. 

Discover Description: 

' •' : " The- Discover program performs energy minimization, template forcing, torsion 
forcing, and dynamic trajectories and calculates properties such as interaction energies, 
derivatives, mean square displacements, and vibrational frequencies. It provides tools for 
performing simulations under various conditions including constant temperature, 
constant pressure, constant stress, periodic boundaries, and fixed and restrained atoms. 
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Homology modeling methods are known to those skilled in^rc art and are described in ' 
the following homology references: 

Bacon, D.J.; Anderson, W.F. "Multiple sequence alignment," J. Mol. Biol., 191, 153-161 
(1990). 

Barker, W. C; George, D. G.; Hunt, L. T. 'Trotein Sequence Database," in Methods in 
Enzymology, 183, 49, Academic Press:San Diego (1990). 

Barton, G.J. "Protein multiple sequence alignment and flexible pattern matching," Meth 
Enzymol, 183, 403-428 (1990). 

Benedetti, E.; Morelli, G.; Nemethy, G; Scheraga H. A. "Statistical and Energetic 
Analysis of Side-chain Conformations in Oligopeptides," Int. J. Peptide Protein Res. 22, 
1-15 (1983). 

Berger, M.P.; Munson, P.J. "A novel randomized iterative strategy for aligning multiple 
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STRUCTURE BASED DRUG DESIGN 
■ • The present invention relates to the use of the crystal structure of the E. coli 

■ MurG protein represented by the atomic coordinates in Table 1 to make models of MurG 
• proteins and binding sites thereof.. The .present invention also relates to the use of the 

crystal structure, a-carbon backbone, a-carbon backbone plus conserved amino acid 
residue side chains or binding sites of the E coli MurG protein to construct models of 
these structures in other MurG proteins. ... 

For the first time, the present invention permits the use of molecular design 

■ techniques to design, select and synthesize chemical entities and compounds, including 
inhibitory compounds, capable of binding to the active site or accessory binding site of 

MURG, in whole or in part. , . 

On approach enabled by this invention, is to use the structure coordinates of 
MURG to design compounds that bind to the enzyme and alter the physical properties of 
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the compounds in different ways, e.g., solubility. For example, this invention enables the 
design of compounds that act as inhibitors of the MURG enzyme by binding to, all or a 
portion of, the active site of MURG. 

A second design approach is to probe a MurG crystal with molecules composed 
of a variety of different chemical entities to determine optimal sites for interaction 
between candidate MURG inhibitors and the enzyme. For example, high resolution X- 
ray diffraction data collected from crystals saturated with solvent allows the 
determination of where each tpe of solvent molecule sticks. Small molecules that bind 
tightly to those sites can then be designed and synthesized and tested for their MURG 
inhibitor activity. Travis, J., Science, 262, p. 1374 (1993). 

This invention also enables the development of compounds that can isomerize to 
short-lived reaction intermediates in the chemical reaction of a substrate or other 
compound that binds to MURG, with MURG. Thus, the time-dependent analysis of 
structural changes in MURG during its interaction with other molecules is enabled. The 
reaction intermediates of MURG can also be deduced from the reaction product in co- 
complex with MURG. Such information is useful to design improved analogues of 
known MURG inhibitors or to design novel classes of inhibitors based on the reaction 
intermediates of the MURG enzyme and MURG-inhibitor co-complex. This provides a 
novel route for designing MURG inhibitors with both high specificity and stability. 

Another approach made possible and enabled by this invention, is to screen 
computationally small molecule data bases for chemical entities or compounds that can 
bind in whole, or in part, to the MURG enzyme. In this screening, the quality of fit of 
such entities or compounds to the binding site may be judged either by shape 
complementarity or by estimated interaction energy. Meng, E. C. et al., J. Coma. Chem., 
13, pp. 505-524 (1992). 

Because MURG may crystallize in more than one crystal form, the structure 
coordinates of MURG, or portions thereof, as provided by this invention are particularly 
useful to solve the structure of those other crystal forms of MURG. They may also be 
used to solve the structure of MURG mutants, MURG co-complexes, or of the crystalline 
form of any other protein with significant amino acid sequence homology to any 
functional domain of MURG. 
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One method that may be employed for this purpose is molecular replacement. In 
this method, the unknown crystal structure, whether it is another crystal form of MURG, 
a MurG mutant, or a MurG co-complex, or the crystal of some other protein with 
significant amino acid sequence homology to any functional domain of MURG, may be 
determined using the MURG structure coordinates of this invention as provided in 
Tables 1-6. This method will provide an accurate structural form for the unknown crystal 
more quickly and efficiently than attempting to determine such information ab initio. 

In addition, in accordance with this invention, MURG mutants may be 
crystallized in co-complex with known MURG inhibitors. The crystal structures of a 
series of such complexes may then be solved by molecular replacement and compared 
with that of wild-type MURG. Potential sites for modification within the various binding 
sites of the enzyme may thus be identified. This information provides an additional tool 
for determining the most efficient binding interactions, for example, increased 
hydrophobic interactions, between MURG and a chemical entity or compound. 

All of the complexes referred to above may be studied using well-known X-ray 
diffraction techniques and may be refined versus 2-3 .ANG. resolution X-ray date to an R 
value of about 0.20 or less using computer software, such as X-PLOR (Yale University, 
.COPYRGT. 1992,' distributed 'by Molecular Simulations, Inc.). See, e.g., Blundel & 
Johnson, supra; Methods in Enzvmoloav, vol. 114 & 115, H. W. Wyckoff et al., eds., 
Academic Press (1985). This information may thus be used to design, synthezie and 
optimize novel classes of MURG inhibitors. 

- - The ' structure coordinates of MURG mutants provided in this invention also 
facilitate the identification of related proteins or. enzymes analogous to MURG in 
function, structure or both, thereby further leading to novel therapeutic modes for treating 
or preventing UDP-glycosyltransferase mediated diseases. 

■• The design of compounds that bind to. or inhibit MURG according to this 
invention generally involves consideration of two factors. First, the compound must be . 
capable of physically and structurally associating with MURG. Non-covalent molecular 
interactions important in the association of MURG with its substrate include hydrogen 
bonding, van der Waals and hydrophobic interactions. • 

Second, the compound must be able to assume. a conformation that allows it to 
associate with MURG. Although certain portions of the compound will not directly 
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participate in this association with MURG, those portions may still influence the overall 
conformation of the molecule. This, in turn, may have a significant impact on potency. 
Such conformational requirements include the overall three-dimensional structure and 
orientation of the chemical entity or compound in relation to all or a portion of the 
binding site, e.g., active site or accessory binding site of MURG, or the spacing between 
functional groups of a compound comprising several chemical entities that directly 
interact with MURG. 

The potential inhibitory or binding effect of a chemical compound on MURG 
may be analyzed prior to its actual synthesis and testing by the use of . computer 
modelling techniques. If the theoretical structure of the given compound suggests 
insufficient interaction and association between it and MURG, synthesis and testing of 
the compound is obviated. However, if computer modelling indicates a strong 
interaction, the molecule may then be synthesized and tested for its ability to bind to 
MURG and inhibit using the assay of Walker et al. patents (cited supra). In this manner, 
synthesis of inoperative compounds may be avoided. 

An inhibitory or other binding compound of MURG may be computationally 
evaluated and designed by means of a series of steps in which chemical entities or 
fragments are screened and selected for their ability to associate with the individual 
binding pockets or other areas of MURG. 

One skilled in the art may use one of several methods to screen chemical entities 
or fragments for their ability to associate with MURG and more particularly with the 
individual binding pockets of the MURG donor nucleotide binding site, acceptor binding 
site or membrane association site. This process may begin by visual inspection of, for 
example, the binding sites on the computer screen based on the MURG coordinates in 
Tables 1-6. Selected fragments or chemical entities may then be positioned in a variety 
of orientations, or docked, within an individual binding pocket of MURG as defined 
supra. Docking may be accomplished using software such as Quanta and Sybyl, followed . 
by energy minimization and molecular dynamics with standard molecular mechanics 
forcefields, such as CHARMM and AMBER. 

Specialized computer programs may also assist in the process of selecting 
fragments or chemical entities, including but not limited to: 
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1. GRID (Goodford; P. J., "A Computational Procedur^or Determining Energetically' 
Favorable Binding Sites on Biologically Important Macromolecules" J. Med. Chem., 28, 
pp. 849-857 (1985)). GRID is available from Oxford University, Oxford, UK. 

2. MCSS (Miranker, A. and M. Karplus, "Functionality Maps of Binding Sites: A 
Multiple Copy Simultaneous Search Method." Proteins: Structure, Function and 
Genetics, 11, pp. 29-34 (1991)). MCSS is available from Molecular Simulations, 
Burlington, Mass. • ■ , 

3. AUTODOCK (Goodsell, D. S. and A. J. Olsen, "Automated Docking of Substrates to 
Proteins by Simulated Annealing" Proteins: Structure. Function, and Genetics, 8, pp. 
195-202 (1990)) (AUTODOCK is available from Scripps Research Institute, La Jolla, 
Calif.). 

4. DOCK (Kuntz, I. D. et al., "A Geometric Approach to Macromolecule-Ligand 
Interactions" J. Mol. Biol., 161, pp. 269-288 (1982)). DOCK is available from University 
of California, San Francisco, Calif. 

Once suitable chemical entities or fragments have been selected, they can be 
assembled into a single compound or inhibitor. Assembly may be proceed by visual 
inspection of the relationship of the fragments to each other on the three-dimensional 
image displayed on a computer screen in relation to the structure coordinates of MURG. 
■ This would be followed by manual model building using software such as Quanta or 

Sybyl. •' • • . ■• 

Useful programs to aid one of skill in the art in connecting the individual 

chemical entities or fragments include, but are not limited to: . 

'I. CAVEAT (Bartlett, P. A. et al, "CAVEAT:. A Program to Facilitate the Structure- 
Derived Design of Biologically Active Molecules". In Molecular Recognition in 
" Chemical and Biological Problems", Special Pub., Royal Chem. Soc, 78, pp. 182-196 
(1989)). CAVEAT is available from the University of California, Berkeley, Calif. 



49 



WO 01/90301 PCT/US01/11500 

2. 3D Database systems such as MACCS-3D (MDL Informlffion Systems, San Leandro;'* 
Calif.). This area is reviewed in Martin, Y. C, "3D Database Searching in Drug Design", 
J. Med. Chem., 35, pp. 2145-2154 (1992)). 

3. HOOK (available from Molecular Simulations, Burlington, Mass.). 

Instead of proceeding to build a MurG inhibitor in a step-wise fashion one 
fragment or chemical entity at a time as described above, inhibitory or other MURG 
binding compounds may be designed as a whole or "de novo" using either an empty 
active site or optionally including some portion(s) of a known inhibitor(s). These 
methods include, but are not limited to: 

1. LUDI (Bohm, H.-J., "The Computer Program LUDI: A New Method for the De Novo 
Design of Enzyme Inhibitors", J. ComR. Aid. Molec. Design, 6, pp. 61-78 (1992)). LUDI 
is available from Biosym Technologies, San Diego, Calif. 

2. LEGEND (Nishibata, Y. and A. Itai, Tetrahedron, 47, p. 8985 (1991)). LEGEND is 
available from Molecular Simulations, Burlington, Mass. 

3. LeapFrog (available from Tripos Associates, St. Louis, Mo.). 

Other molecular modeling techniques may also be employed in accordance with 
this invention. See, e.g., Cohen, N. C. et al., "Molecular Modeling Software and Methods 
for Medicinal Chemistry, J. Med. Chem., 33, pp. 883-894 (1990). See also, Navia, M. A. 
and M. A. Murcko, "The Use of Structural Information in Drug Desjgn", Current 
Opinions in Structural Biology, 2, pp. 202-210 (1992). 

Once a compound has been designed or selected by the above methods, the 
efficiency with which that compound may bind to MURG may be tested and optimized 
by computational evaluation. For example, a compound that has been designed or 
selected to function as a MurG-inhibitor must also preferably traverse a volume not 
overlapping that occupied by the active site when it is bound to the native substrate. An 
effective MURG inhibitor must preferably demonstrate a relatively small difference in 
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energy between its bound and free states (i.e., a small deformation energy of binding). 
Thus, the most efficient MURG inhibitors should preferably be designed with a 
deformation energy of binding of not greater than about 10 kcal/mole, preferably, not 
greater than 7 kcal/mole. MURG inhibitors may interact with the enzyme in more than 
one conformation that is similar in overall binding energy. In those cases, the 
deformation energy of binding is taken to be the difference between the energy of the 
free compound and the average energy of the conformations observed when the inhibitor 
binds to the enzyme. 

A compound designed or selected as binding to MURG may be further 
computationally optimized so that in its bound state it would preferably lack repulsive 
electrostatic interaction with the target enzyme. Such non-complementary (e.g., 
electrostatic) interactions include repulsive charge-charge, dipole-dipole and charge- 
dipole interactions. Specifically, the sum of all electrostatic interactions between the 
inhibitor and the enzyme when the inhibitor is bound to MURG, preferably make a 
neutral or favorable contribution to the enthalpy of binding. - 

Specific computer software is available in the art to evaluate compound 
'deformation energy and electrostatic interaction. Examples of programs designed for 
such uses include, but are not limited to: Gaussian 92, revision C [M..J. Frisch, Gaussian, 
Inc., Pittsburgh, Pa. .COPYRIGHT. 1992]; AMBER, version 4.0 [P. A. Kollman, 
University of California at San Francisco, .COPYRIGHT; 1994]; QUANT A/CHARMM 
[Molecular Simulations, Inc., Burlington, Mass. .COPYRIGHT. 1994]; and Insight 
U/Discover (Biosysm Technologies Inc. San biego, Calif. .COPYRIGHT. 1994). These 
" programs may be implemented, for instance, using a Silicon Graphics workstation, IRIS 
' Octane or IBM RISC/6000 workstation. Other hardware systems and software packages 
will be known to those skilled in the art. • 

Once a MurG-binding compound has been optimally selected or designed, as 
' ' described above, substitutions may then be made in some of its atoms or side groups in 
order to improve or modify, its binding properties. Generally, initial substitutions are 
conservative, i.e., the replacement group will have approximately the same size, shape, 
hydrophobicity and charge as the original group. It should, of course, be understood that 
components known in the art to alter conformation should be avoided. Such substituted 
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chemical compounds may then be analyzed for efficiency o^fft to MURG by the same 
computer methods described in detail, above. 



COMPOUNDS AND COMPOSITIONS COMPRISING COMPOUNDS 
DERIVED FROM STRUCTURE BASED DRUG DESIGN 

One embodiment of the present invention is a compound that is capable of 
binding to a MurG protein, inhibiting the activity of a MurG protein, or stimulating the 
activity of a MurG protein. Suitable inhibitory compounds of the present invention can: 
(1) inhibit (i.e., prevent or block) the activity of MurG enzyme by binding to a MurG 
donor nucleotide binding site and interfering with the binding of the donor nucleotide 
molecule; (2) inhibit the activity of MurG enzyme by binding to the MurG acceptor 
binding -site and interfering with the binding of the acceptor molecule; (3) inhibit the 
activity of a MurG enzyme by binding to the membrane association site and interfering 
with the association of the protein with the bacterial membrane and/or acceptor molecule. 

Another embodiment of the present invention is a compound that is capable of 
stimulating MurG activity. Suitable stimulatory compounds of the present invention can 
stimulate the activity of a MurG enzyme by binding to the protein at a binding site and 
causing an increase in enzymatic activity, for example, by increasing the enzymes 
affinity to bind a donor nucleotide, an acceptor molecule or improve the enzymes 
stability or increasing the binding affinity of a molecule to MurG. 

Such compounds that bind to, inhibit or stimulate activity of a MurG protein 
include, for example, compounds that mimic donor nucleotide molecules. In preferred 
embodiments, the compound includes, for example, pyrimidine nucleoside analogues. In 
yet another preferred embodiment, the compounds include compounds comprising a 
pyrimidine nucleoside with a substituent containing at least one heteroatom attached to 
the C5 hydroxyl. In more particular embodiments, pyrimidine derivatives make 
complementary hydrogen bonding contacts to the amide backbone segment containing 
He 245 and also contact glutamate 269. 

Another embodiment of the present invention is a compound that binds to the 
acceptor binding site of the MurG protein, hereinafter referred to a acceptor analogs. An 
acceptor analog refers to a compound that interacts with (e.g., binds to, associates with, 
modifies) the acceptor binding site of a MurG protein. An acceptor analog, for example, 
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is a compound that mimics the natural acceptor molecule, Cipid I. Examples of such 
acceptor analogs are set forth in Ha et al., J. Amer. Chem. Soc. 1999, and 
PCTAJS99/02187, U.S. Provisional Application No. 60/073,376 filed February 2, 1998, 
incorporated herein by reference. 

Another embodiment of the present invention is a compound that binds to the 
MurG protein, that are enzyme product analogs, hereinafter referred to as Lipid II 
analogs. A Lipid II analog refers to a compound that interacts with (i.e. , binds to, 
associates with, modifies) the acceptor binding site of a Mur G protein which mimics the 
product of the transglycosylase reaction. 

Inhibitory and stimulatory compounds of the present invention can be identified 
by various means known to those of skill in the. art. For example, binding of an 
inhibitory compound- to, or otherwise interaction with, a MurG protein, can be 
determined with MurG in solution, for example, using assays described in 
PCT/US99/02187, U.S. Provisional Application No. 60/073,376 filed February 2, 1998. 
and PCT/US00/05554 , U.S. Provisional Application Nos. 60/122,966 and 60/137,696, 
incorporated herein by reference. 

According to the present invention, suitable compounds of the present invention 
include peptides or other organic molecules, and inorganic molecules. Suitable organic 
molecules include small organic molecules. Preferably, a compound of the present 
invention is not harmful (i.e., toxic) to an animal when administered to an animal. 

Compounds of the present invention also can be identified using structure based 
drug design techniques known to those skilled in the art and described herein above. 

' Also according to the present invention, compounds; are suitable for use in the 
inhibition of bacterial or microbial growth in an animal, and for example, function as an 
antibiotic for treatment of bacterial infections in animals. 

• : The present invention also includes compositions comprising compounds of the 
present invention that inhibit or stimulate MurG activity which function as antibiotics or 
antimicrobial agents in animals. Compositions of the present invention can be used 
therapeutically or diagnostically in an animal. Compositions of the present invention 
comprises at least one compound of the present invention. In a preferred embodiment, 
compositions of the present invention further comprise . a carrier. More particularly, a 
suitable carrier is a pharmaceutical^ acceptable carrier known to those skilled in the art. 
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TABLE 1- ATOMIC COORDINATES OF E. COLI MURG PROTEIN 

REMARK coordinates from minimization re finement REMARK refinement 
resolution: 40.0 - 1.9 AREMARK starting r= 0.2200 free_r- 0.2466REMARK 
final r= 0.2200 free_r= 0.2466REMARK rmsd bonds= 0.005558 rmsd 

angles* 1.295 05REMARK wa= 1 . 08391REMARK target= mlf cycles= 1 steps- 
30REMARK sg= PI a= 60.613 b= 66.356 c= 67.902 alpha= 64.294 beta = 
83.520 gamma- 65.448REMARK parameter file 1 : 
CNSJTOPPAR: protein_rep . paramREMARK parameter file 2 : 
CNSJTOPPAR: water._r.ep. paramREMARK parameter file 3 : 

CNSJTOPPAR: ion . paramREMARK molecular structure file: gen . mtf REMARK 
input coordinates: gen . pdbREMARK reflection file= na t i ve . cv REMARK ncs= 
noneREMARK B-correction resolution: 6.0 - 1 . 9REMARK initial B-factor 
correction applied to fobs : REMARK Bll = 0.747 B22= 2.098 B33= - 
2.84 5REMARK B12« -1.847 313= -3.752 B23= 6.401 REMARK B-factor 
correction applied to coordinate array B: 0.038 REMARK bulk solvent: 

density level- 0.351665 e/A~3, B-factor- 4 3.8282 A~ 2 REMARK reflections 
wich ( Fobs | /$igma_F < 2.0 re j ectedREMARK reflections with I Fobs ! > 
10000 * rms(Fobs) re j ectedREMARK theoretical total number of ref 1 . in 
resol. range: 68102 (100.0%) REMARK number of unobserved reflections (no 
entry or |F|-0) : 2825 ( 4 . 1% ) REMARK number of reflections rejected: 
3288 (4.8 %) REMARK total number of reflections used: 
61989 91.0%) REMARK number of reflections in working set: 
55765 (81.9%) REMARK number of reflections in test set: 

6224 (9.1%)CRYST1 60.613 66.356 67.902 64.29 83.52 65.45 P 1 
REMARK F I LENAME= "minimi ze 5 . pdb " REMARK DATE : 1 4 -Jan- 00 15:25:36 
created by user: s ha REMARK VERSION: 
0.5 
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1976 O ILE A 270 10.076 3.194 3.075 1.00 28.96 

1977 N ALA A 27 1 8.818 4.568 4.340 1.00 28.51 

1978 CA ALA A 271 9.952 5.374 4.768 1.00 28.79 

1979 CB ALA A 27 1 9.462 6.576 5.572 1.00 28.12 

1980 C ALA A 271 10.918 4.530 5.603 1.00 29.26 

1981 O ALA A 271 12.136 4.575 5.394 1.00 29.35 

1982 N ALA A 272 10.370 3.755 6.534 1.00 28.79 

1983 CA ALA A 272 11.187 2.904 7.397 1.00 29.79 

1984 CB' ALA A 272 10.301 2.207 8.430 1.00 29.28 

1985 C ALA A 272 11.957 1.872 6.566 1.00 30.22 

1986 O ALA A 272 13.102 1.539 6.876 1.00 29.36 

1987 N ALA A 273 11.327 1.377 5.503 1.00 30.03 

1988 CA ALA A 273 11.961. 0.394 4.628 1.00 30.65 

1989 CB ALA A 273 10.914 -0.306 3.782 1.00 29.48 

1990 C ALA A 273 13.005 1.04 1 3.720 1.00 31.45 

1991 O ALA A 273 13.803 0.346 3.090 1.00 31.87 

1992 N GLY A 274 12.998 2.368 3.662 1.00 31.20 

1993 CA GLY A 274 13.937 3.078 2.814 1.00 32.26 

1994 C GLY A 274 13.725 2.633 1.362 1.00 32.80 

1995 O GLY A 274 14.652 2.226 0.692 1.00 33.38 

1996 N LEU A 275 1.2.501 2.862 0.873 1.00 32.88 

1997 CA LEU A 275 12.169 2.494 -0. 497 1.00 32.70 

1998 CB LEU A 275 11.266 1.262 -0.502 1.00 32.79 

1999 CG LEU A 275 11.969 -0.138 -0.4 31 1.00 33.70 

2000 CD1 LEU A 275 10.762 -1.133 -0.114 1.00 33.69 

2001 CD2 LEU. A 275 12.538 -0.484 -1.762 1.00 32.25 

2002 C LEU A 275 11.479 3.568 -1.324 1.00 33.39 

2003 O LEU A 275 10.638 4.320 -0.819 1.00 32.48 

2004 N PRO A 276 11.835 3.654 -2.617 1.00 32.76 

2005 CD PRO A 276 13.022 3.048 -3.244 1.00 32.59 
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BBBBATOM 2 835 CA GLY B 34 

BBBBATOM 2836 C GLY B 34 

BBBBATOM 2837- O .GLY B 34 

BBBBATOM. 2838 N ' TRP B 35 

BBBBATOM ■ 2839 CA TRP B 35 

BBBBATOM 2 84 0 CB * TRP B 35 

BBBBATOM. 2841 . CG TRP B 35 

BBBBATOM ' 2842 CD2 TRP B 35 

BBBBATOM 2843 CE2 TRP B 35 

BBBBATOM . 2844 . CE3 TRP B 35 

BBBBATOM 2845' CD1 TRP B 35 

BBBBATOM 284 6 NET TRP B 35 

BBBBATOM 2847 CZ2 TRP B 35 

BBBBATOM 2848 CZ3 TRP B 35 

BBBBATOM ; 2849 CH2 TRP B 35 

BBBBATOM 28 50 C : TRP B 35 

BBBBATOM -2851 O . TRP B 35 

BBBBATOM 2852 N GLN B 36 

BBBBATOM 2853 CA GLN B 36 

BBBBATOM -.2854 CB" GLN 8 36 

BBBBATOM '2855 CG GLN B 36 

BBBBATOM ' -2856. CD GLN B 36 

BBBBATOM 2857 OE1 GLN B 36 

BBBBATOM .2858 NE2 GLN B 36 

B3BBATOM 2 8 59 C ' GLN B 3 6 

BBBBATOM ■ 28 60 0- . GLN B 36 

BBBBATOM " 2861 N * VAL B 37 

BBBBATOM 2862 CA VAL B 37 

BBBBATOM . 2863 .CB VAL B 37 

BBBBATOM " 28 64 CGI VAL B 37 



PCT/USO 1/1 1500 

0, 436 -33^12 -40.583 1.00 37.53 
1.055 -33.155 -41.657 1.00 37.93 
1.207 -34.154 -42.508 1.00 37.24 
-2.567 -36.324 -41.068 1.00 33.00 
-2.845-35.674 -42.078 1.00 33.02 
-2.676 -37.650 -41.012 1.00 31.69 
-3.136 -38.417 -42.162 1.00 32.00 
-2.701 -39.883 -42.033 1.00 30.44 
-1.191.-40.099 -42.203 1.00 29.95 
-0.823 -41.550 -41.898 1.00 28.64 
-0.785 -39.736 -43.621 1.00 28.97 
-4.641 -38.332 -42.375 1.00 32.37 
-5.109-38.414 -43.507 1.00 33.16 
-5.401 -38.168 -41.297 1.00 33.50 
-6.849 -38.064 -41.424 .1.00 34.91 
-7.514 -38.061 -40.048 1.00 35.67 
-7.536 -39.424 -39.391 1.00 36.44 
-8.407 -39.436 -37.827 1.00 37.84 
-7.095 -39.039 -36.719 1.00 37.80 
-7.197 -36.786 -42.173 1.00 35.99 
-8 224 -36.710 -42.847 T. 00 37.36 
-6.326 -35.791 -42.052 1.00 36.39 
-6.510 -34.511 -42.722 1.00 37.55 
-5.622 -33.4.58 -42.073 1.00 36.85 
-6.151 -34.663 -44.205 1.00 38. 60 
-6.392 -33.760 -45.013 1.00 38.04 
-5.569 -35.809 -44.555 1.00 38.35 
-5 182 -36.070 -45.938 1.00 38.24 
-3.792 -36.706 -46.011 1.00 39.81 
-2.771 -36.162 -45.028 1.00 4 1.67 
-2.269 -34.781 -45.384. .1.00 41.96 
-1.735 -34.559 -46.471 1.00 41.35 
-2 423 -33.843 -44.456 ' 1.00 42.42 
-6.187 -37.027 -46.556 1.00 36.70 
-5.970 -37.541 -47.647' "1.00 36.47 
.7*281 -37.274 -45.845" 1.00 36.09 
-.8.305 -38.169 -46, 353- 1.00 35.75 
-8' 219' -39 . 600 -45.837 1.00 35.33 
-9*075 -40. 426 -46.146 -1.00 34.43 
-7.196 -39.896 -45.042 1.00 35.23 
-7..016 -41.246 -44.508 1.00 34.58 
-51618 -41.400 -43.908 "1.00 35.03 
-4.511 -41 . 558 -4.4.901 .1.00 34 . 87 
-3.666 -42-. 704 -45.060 1.00 35.81 
-2. 733 -42.399 -46.076/ 1.00 35.60 
-3.605 -43.961 -4 4'. 4 4 1 ' - 1 . 00 36.59 
-4.075 -40.633 -45.803 1.00 35.62 
-3.004 -41.129 -46.513; 1.00 34.95 
-1.746 -43.305 -46.488 1.00 37.19 
-2 620-44.865 -44.352 1.00 36.85 
-1.705 -44.528 -45.867 . 1\ 00 36.87 
-8 022' -41.670 -43.449. 1.00 34.58 
-8.546 -40.850 -42.699 ' 1.00 33.58 
-8 295 -42.969 -43.410 1.00 34.54 
-9.175 -43.535 -42.402" 1.00 35.40 
-10.095 -44.596 -43.003 \ 1.00 37.42 
-11.219 -44.027 -43.862" 1.00 41.66 
-11.996 -45.103 -44.601". 1.00 44.12 
-12.988 -44.816 -45.277 .1.00 45.76 
-11 546 -46.350 -44 /4 80'. 1.00 44.99 
-8.183 -44.186 -41.451' 1.00 34.89 
-7.272 -44;885 -41.889 1.00 34.53 
- -8.336 -43-. 941 "-40. 158-': 1.00 35.26 
-"--7*417 -44 /516 -39.184 1.00 34.16 
-6 561 -43. 4 17 -38': 509 1.00 34.99 
-5 630 -44.-032 -37.479 1.00 34.72 
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PCT/USO 1/11500 
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O 
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37 
37 
37 
38 
38 
38 
38 
38 
38 
38 
38 
38 
38 
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39 
39 . 
39 
39 
39 
39 
39 
39 
39 
39 
39 
39 
39 
39 
40 
40 
40 
40 
40 
40 
40 
40 
41 
41 
41 
41 
42 
42 
42 
42 
42 
42 
42 
4 3 
43 
43 
43 
43 
44 
44 
44 
44 
44 
44 
44 
44 
45 
45 
45 
45 
45 
45 



-5.755 - 
-8 . 166 - 
-9.207 ■ 
-7. 629 • 
-8.219 • 
-8.811 
-9.106 
-11. 107 
-12.000 
-13. 304 
-13.892 
-14 . 026 
-7 . 097 
-5. 936 
-7.447 
-6.456 
-6. 696 
-5 . 480 
-4 . 677 
-3. 625 
-4.749 
-4 .898 
-3. 783 
-2 . 648 
-3 .770 
-2 . 734 



-6 
-7 
-5 
-5 
-4 



478 
, 509 
. 323 
. 200 
. 326 



-4.416 
-3.037 
-5 .421 



-4 
-3 
-5 
-4 



535 
387 
260 
691 
-5 . 292 
-5 .797 
-5 .225 
-5 . 787 
-4 . 693 
-4 .019 
-3. 683 
-6.852 
-6.737 
-7 . 881 
-9 . 000 
-10 . 063 
-8 . 671 
-9.224 
-7 .777 
-7 .455 
-7.311 
-8 . 646 
-9.553 
-3 . 791 
-6. 236 
-5. 520 
-6.001 
-4 . 887 
-3.802 
-2.935 
. -2.218 
-1.385 



42 . 668 
45.288 
44.849 - 
46.451 - 
47 . 286 - 
•48.554 - 
■4 9.324 - 
-49.511 - 
-50.108 - 
-50.293 - 
-49.927 - 
-50.839 - 
-47.628 - 
-47 .742 - 
-47.782 - 
-48.070 - 
-47 . 145 ■ 
-46.788 • 
-45.627 
-45.673 
-44 . 549 
-47 . 476 
-46.812 
-44 . 684 
-43. 561 
-43. 639 
-49.517 
-50.176 
-50.004 
-51. 364 
-52.221 
-53.754 
-54 . 334 
-54 . 179 
-51.235 
-50.824 
-51.567 
-51 . 450 
-52 .479 
-53 . 498 
-52.225 
-53. 141 
-53.715 
-52 . 647 
-54 . 514 
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15.804 -18.740 -16.037 1.00 16.88 

15.971 -19.367 -17.087 1.00 15.75 

15.566 -19.326 -14.870 1.00 15.97 

15.508 -20.771 -14.741 1.00 16.47 

16.259 -21.245 -13.476 1.00 15.51 

16.316 -22.770 -13.448 1.00 17.66 

17.652 -20.626 -13.430 1.00 16.97 

14.076 -21.282 -14.644 1.00 16.41 

13.343 -20.920 -13.732 1.00 18.33 

13.695 -22.128 -15.588 1.00 15.92 

12.361 -22.710 -15.604 1.00 16.75 

11.813 -22.701 -17.035 1.00 16.85 

10.445 -23:340 -17.276 1.00 18.63 

9.368 -22.478 -16.625 1.00 19.42 

10.198 -23.449 -18.783 1.00 19.11 

12.450 -24.146 -15.085 1.00 16.85 

13.115 -24.982 -15.688 1.00 17.18 

11.788 -24.426 -13.964 1.00 18.20 

11.774 -25.775 -13.381 1.00 18.41 

11.902 -25.714 -11.842 1.00 18.98 

12.088* -27.126 -11.270 1.00 18.50 

13.061 -24.818 -11.449 1.00 13.33 

10.434 -26.440 -13.739 1.00 19.88 

9.371 -25.967 -13.336 1.00 20.39 

10.493 -27.532 -14.496 1.00 21.55 

9.298 -28.234 -14.948 1.00 22.11 

9.299 -28.342 -16.488 .liOO 22.50 
8.009 -29.013 -16.981 1.00 22.70 
9.470 -26.943 -17.101 1.00 21.26 
9.191 -29.639 -14.351 1.00 23.90 

10.067 -30.478 -14.559 1.00 23.61 

. 8.111 -29.887 -13.615 1 . 00 25.60 

7.914 -31.188 -12.994 1.00 27.28 

" 6.808 -32.026 -13.604 1.00*29.67 

6.668 -33.208 -13.283 1.00 29.86 

6.025 -31.430 -14.497 1.00 30.56 

4 .935- -32.163 -15.115 1.00 31.94 

3.676 -32.104 --14.269 1.00 33.11 

3.691 --31.556 -13.165 1..00 32.14 

'2.587 -32.673-14.779 1.00 34.23 

•1.313 -32.665- -14 .064 1.00 35.91 

• 0.283 -33.532 -14.301 1.00 36. 87 

0.702 -34.887 -14.877 1.00 39.58 

.1.419-33.128 -12.609 1.00 36.41 

0.8 62 -32.4 99 -11.714 . 1.00 35.7 8 

= 2 . 134 -34;225 - 12 . 380 . . 1 J 00 37.60. 

2.292-34.763 -11.033 1.00 38.53 

'2.584 -36.263 -11.096 1.00 41.10 

• 1 . 501' -37 .082 -11.779 .1.00 45.38 

0.152 -36.948 -11.099 1.00 47.81 

-0,484 -35.892 -11.149 K 00 4 9.50 

-0.290 -38.023 -10.452 1.00 49.36 

3.405 -34.072 -10.260 1.00 37.76 

3.458 -34.146- -9.030 l.'OO 37.02 

'■ 4.291- -33.398 -10.986 1.00 36.47 

5.398 -32.711-10.350 1.00 35.02 

6.584 -33.630 -10.146 1.00 34.51 

6.442 -34.351 -10.191 1.00 34.26 

7.761 -33.045 -9.938 ' T. 00 33.54 

8.977 -33.81.9 -9.709 1.00 33.12 

10.073 -33.337 -10.679 1.00 33.17 

9.423 -33.590 -8.267 1.00 32. S7 
9.955 -32.533 -7.923. 1.00 31.47 

9.195 -34.592 -7.4.26 -1.00 32.81 

9.538 -34.512 -6.010 1.00 32.63 

9.373 -35.891 -5.361 1.00 35.51 
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17.980 - 
16.4 66 ■ 
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19 . 260 
20 . 924 
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21 .4 98 
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22 . 689 
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23.237 
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22.341 
22.577 
21 . 975 
2 2.5 3.4 
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20.238 
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28 . 059 
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28 . 960 
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6.4 61 1 . 00 34.79 
5.011 1 . 00 35 . 34 
4 . 299 1 . 00 35.61 
•3.071 1.00 35.51 
•5. 038 1 . 00 34 .65 
-4.418 1.00 33.98 
-5.062 1.00 35.02 
-4.897 1.00 36.07 
-4 . 504 1 . 00 33.55 
-4 . 086 1 . 00 31.72 
-5.044 1.00 33.22 
-5 .186 1 • 00 33.71 
-5 .799 1 . 00- 35.45 
-6 . 337 1 . 00 37 . 82 
-5.249 1.00 40 . 08 
-4 .217 1 . 00 40 . 97 
-5.486 1.00 40 . 08 
-3 . 840 1 . 00 33.86 
-3. 697 1 .00 33. 38 
-2 . 850 1 . 00 33 . 06 
-1 . 519 1 . 00 32 . 32 
-0 . 640 1.00 35. 12 
-1.151 1 . 00 36.85 
-2.495 1.00 39.09 
-2.759 1 .00 39.42 
-3.345 1 . 00 40.13 
-0.788 1.00 30.73 
-0.202 1.00 30.36 
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0. 003 1 . 00 29 . 10 
-1.254 1.00 32.61 
-0.771 1 .00 26.41 
-0.073 1.00 25.24 
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-4 . 317 1 . 00 23. 32 
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-4 . 563 1'. 00 22.45 
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-2 . 623 1. 00* 26. 12 
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1..407 1.00 27.09 
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0.677 1.00 26.97 
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-0.184 1.00 25.07 
0. 192 1.00 26. 36 
0.755 1.00 25.04 
-0.854 ' 1..00 25.85 
-1 . 366 . 1 .00 25. 47 
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9.837 -15.012 -3.043 1.00 25.33 

8.720 -15.313 -3.465 1.00 24.93 

10.663 -14.197 -3.690 1^.00 24.47 

10 277 -13.558 -4.945 1.00 24.29 

11.281 -12.455 -5.306 1.00 23.76 

12.622 -12.945 -5.852 1.00 25.22 

13.535 -13.523 -4.783 1.00 26.90 

13.188 -13.550 -3.600 1.00 28.05 

14.712 -13.983 -5.195 1.00 25.61 

10.108 -14.489 -6.143 1.00 23.41 

9 485 -14.112 -7.139 1.00 20.49 

10 655 -15.700 -6.069 1.00 22.66 

10 526 -16.608 -7.201 1.00 22.03 

11*432 -17.829 -7.003 1.00 22.41 

12*891 -17.483 -6.998 1.00 20.93 

13 869 -17.774 -6.104 1.00 20.09 

13 488 -16.743 -7.992 1.00 20.30 

14 769 -16.582 -7.711 1.00 22.38 
15.025 -17.197 -6.569 1.00 19.62 
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0^ PCT/US01/11500 

10 194 -33.557 -23.657 1.00 22.34 
10.348 -33.803 -22.140 1.00 22.35 

9 061 -34.087 -21.583 1.00 24.46 

10 945 -32.573 -21.444 1.00 24.00 
11*535 -33.117 -24.226 1.00 21.15 
11.761 -31.926 -24.442 1-00 20.35 

12 427 -34.075 -24.461 1.00 20.46 

13 730 -33.762 -25.023 1.00 21.11 

14 614 -35.039 -25.114 1.00 21.54 
15*903 -34.740 -25.865 1.00 20.72 
14 938 -35.541 -23.708 1.00 20.45 

13 548 -33.138 -26.416 1.00 21.34 

14 188 -32.135 -26.747 1.00 19.99 
12 663 -33.717 -27.222 1-00 21.61 
12*411 -33.191 -28.567 1.00 21.96 

11 474 -34.121 -29.344 1.00 21.57 

12 141 -35.316 -29.721 1.00 24.06 
11*817 -31.790 -28.519 1.00 21.81 
12 158 -30.933 -29.336 1.00 22.60 

10 928 -31.563 -27.557 1.00 21.64 

10 282 -30.272 -27.378 1.00 21.95 
9 213 -30.399 -26.292 1.00 24.72 
8 480 -29.128 -25.940 1.00 27.67 
7 385 -29.380 -24.908 1.00 30.05 
6*325 -29.915 -25.237 1.00 31.50 
7 591 -29.057 -23.719 1.00 29.84 

11 321 -29.214 -26.999 1.00 21.68 

11 301 -28.095 -27.518 1.00 18.12 

12 224 -29.581 -26.092 1.00 19.43 
13*295 -28.698 -25.638 1.00 20.62 
14*157 -29.391 -24.533 1.00 20.30 
15 415 -28.595 -24.266 1.00 19.17 

13 337 -29.574 -23.254 1.00 21.32 
12.926 -28.291 -22.583 1.00 23.40 

14 214 -28.314 -26.806 1.00 20.58 
14*595 -27.151 -26.954 1.00 20.50 

14 574 -29.298 -27.624 1.00 21.29 

15 440 -29.058 -28.776 1.00 22.45 
15 741 -30.376 -29.485 1.00 23.36 

14 766 -28.084 -29.745 1.00 23.03 

15 400 -27.156 -30.259 1.00 23.57 
13 479 -28.301 -29.988 1.00 21.81 
12 719 -27.451 -30.898 1.00 22.17 

11 *. 335 -28.053 -31.131 1.00 22.15 

12 590 -26.030 -30.355 1.00 22.78 
12*585 -25.058 -31.122 1.00 21.77 

12 474 -25.907 -29.034 1.00 21.06 
12*361 -24.596 -28.407 1.00 21.97 
11 919 -24.737 -26.949 1.00 20.63 

13 699 -23.867 -28.468 1.00 21.83 

13 754 -22.642 -28.344 1.00 22.67 

14 773 -24.621 -28.656 1.00 20.90 

16 093 -24.023 -28.709 1.00 21.07 
16 498 -23.549 -27.327 1.00 21.48 
16 961 -22.421 -27.154 1.00 20.01 
16 331 -24.420 -26.333 1.00 19.66 

16 666 -24.057 -24.966 1.00 19.78 

15 402 -24.068 -24.102 1.00 21.30 
14*451 -22.870 -24.202 1.00 24.60 
13*220 -23.136 -23.349 1.00 26.44 
15*159 -21.609 -23.725 1.00 24.02 
17*698 -24.970 -24.318 1.00 18.79 

17 679 -26.180 -24.524 1.00 19.36 
18*634 -24.389-23.554 1.00 17.27 
18925 -22.955 -23.416 1.00 16.95 
19*651 -25.199 -22.875 1.00 16.62 
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4 647 
4648 
4 649 
4 650 
4 651 
4 652 
4 653 
4 654 
4 655 
4 656 
4 657 
4658 
4 659 
4660 
4661 
4 662 
4663 
4664 
4665 
4666 
4667 
4668. ' 
4669 
4670 
4671 
4672 
4673 
4 67 4 
4675 
4 676 
4 677 
4678 
4 67*9 
4 680 
4681 
4 682 
4 683 
4684 
4685 
4686 
4687 
4688 
'4 689 
4 690 
4691 
4692 
4693 
4694 
4 695 
4696 
4697 
4698 
4 699 
4 700 
4701 
- 4702 
4703 
4 7 0.4 
4705 
4706 
4707 
4708 
4709 
4 710 
. 4711 
4 712 



CB PRO B 276 
CG PRO B 276 
C PRO B 276 
0 PRO B 276 
N ■ ALA B 277 
CA ALA B 27 7 
CB ALA B 277 
C ALA B 277 
O " ALA B 277 
N LEU B 278 
CA LEU B 278 
CB LEU B 278 
CG LEU B 278 
CD1 LEU B 278 
CD2 LEU B 278 
C LEU B 278 
O LEU B 278 
N PHE B 279 
CA PHE B 27 9 
CB PHE B 279 
CG PHE B 279 
C01 PHE B 279 
CD2 PHE B 279 
CE1 PHE B 279 
CE2 PHE B 279 
CZ PHE B 279 
C PHE B 279 
O PHE B 279 
N VAL B 280 
CA VAL B 2 80 
CB VAL B 280 
CGI VAL *B 280 
CG2 VAL B 280 
C VAL B 280 
O VAL B 230 
N PRO B 281 
CD PRO B 281 
CA PRO B 281 
CB PRO B 281 
CG -PRO B 281 
" c PRO B 281 
O PRO B 281 
N PHE B-282 
CA PHE B 282 
CB * PHE B. 282 
CG PHE' B 282 
CD1 PHE B. 232 
CD2 PHE B 282 
CE1 PHE B .282 
CE2 PHE B 282 
CZ PHE B 282 
C PHE 3 282 
„0 PHE B 282 
N GLN B 283 
CA GLN B • 283 
CB GLN' B 283 
CG GLN B 28 3 
CD GLN B 283 
OE1 GLN B 283 



NE2 GLN B 283 



C ' 

O 

N ' 

CA 

CB 

CG 



GLN B 283 
GLN B -283 
HIS .B 284 
HIS B-284 
HIS. B 284 
HIS B 284 



20-619 
20. 408 
18 . 900 
17 . 944 
19.316 
18.638 
17 . 641 
19. 591 
20.710 
19.147 
19.896 
20. 140 
21 . 084 
21.283 
20 . 497 
18 .884 
17 . 870 
19. 149 
18 .266 
18 . 385 

17 .740 

18 . 481 
16.379 
17.874 
15.759 
16. 515 
18. 525 
19.671 
17 . 445 
17 . 502 
16.883 
16. 954 
17-. 631 
16. 690 

. 15.509 
17.324 
18.750 
16.698 
17^851 
18.791 
16.092 
16. 675 
14 . 908 
14.246 
12 . 818 
12. 032 
11/720 
11 . 590 
10. 975 
10.843 
10. 536 
15.078 
15. 357 
15. 492 
16.319 
17.010 
17.953 
19. 143 
20 . 199 
18 . 969 
15.543 
14 . 382 
16.211 
15 . 641 
14 . 508. 
13.795 
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24.162 • 
22.957 
25.900 
25.340 
•27 . 110 
•27 .807 
-28.805 
-28.526 
-28.891 
-28.673 
-29.429 
-28 . 619' 
-29 . 308 
-28.411 
-30. 647 
-30. 535 
-30. 330 
-31. 691 
-32 .838 
-33.700 
-33.099 
-32 .898 
-32.794 
-32. 405 
-32.298 
-32 . 108 



-33.709 
-34 .065 
-34 . 037 
-34 . 902 
-34 . 223 
-35. 159 
-32 . 929 
-36. 136 
239 



-3 

-37 . 080 
-37 .057 
-38.320 
-39.071 
-37 . 992 
-39.121 
-39. 223 
-39. 668 
-40 . 496 
-40 . 808 
-41 . 606 
-41.074 
-42.886 
-41.806 
-43. 628 
-43.085 
-4 1 . 776 ■ 
-42. 373 
-42. 197' 
-43. 395 
-43.477. 
-44.. 676 
-44 . 605 
-44 . 063 
-45. 135 
-4 4 . 67 9 
-44-. 809 
-45. 618 
-46. 917 
-46. 760 
-48 .039 



22. 317 
23. 195 
21.746 
21.. 189 
21. 396 
20. 321 
•20. 895 
-19. 382 
•19.755 
-18 .138 
-17.145 
-15.869 
-14 . 368 
-13. 668 
-14 .433 
-16.898 
-16.218 
-17 . 495 
-17 . 392 
-18 . 651 
-19.876 
-21 . 035 
-19.888 
-22 . 203 
-21. 052 
-22 . 208 
-16. 167 
-15.871 
-15. 461 
-14 . 281 
-13. 048 
-11.847 
-12.742 
-14 . 658 
-14 . 346 
-15. 370 
-15.726 
-15.824 
-16.4 92 
-16 .895 
-14 .68 4 . 
-13. 603 
-14 . 923 
-13. 926 
-14 . 372 
-13 . 377 
-12 . 130 
-13 . 689 
-11 . 209 
-12 .773 
-11 . 532 
-13 . 380 
-14 . 921 
-12 . 690 
-12 . 591 
-11.223 
-11 . 096 
-12 . 051 
-11 .7.12 
-13 . 254 
-12 .846 
-12 . 462 
-13.511 
-13.343 
-14 . 853 
15. 165 



1.00 17.27 
1.00 18.59 
1.00 17.23 
1.00 15.14 
1.00 17,01 
1.00 15.80 
1.00 17.01 
1.00 17.37 
1.00 17.09 
1.00 17.14 
1 . 00 18.48 
1.00 19.19 
1.00 20.85 
1.00 21.11 
1.00 19.16 
1.00 19.62 
1.00 20.77 
1.00 19.50 
.1.00 21.59 
1.00 21.07 
1.00 19.35 
1.00 19.42 
1.00 18.16 
1.00 19.06 
1.00 17.65 
1.00 15.61 
1.00 22.86 
1.00 2 3.32 
1.00 23.88 
1.00 25.67 
1.00 26.89 
1.00 28.12 
1.00 27.70 
1.00 25.65 
1.00 24.57 • 
1 . 00 27 . 03 
1.00 27.31 
1.00 29.05 
1.00 29.44 
1.00 29.67 
. 1:00 31.51 
1.00 '32.26 
1.00 33.83 
1.00 37.13 
1-..00 38.38 
- 1 . 00 40.57 
1.00. 4 1.80 
1 . 00 41.65 
•1.00 42.03 
i>.00 42.49 
1.00 41.7 4. 
1 . 00 38 . 09 
1.00 38 . 33 
1 . 00 39 . 70 
1.00 41.11 
1.00 41.41 
1.00 42.60 
1.00 42.87 
1.00 43.66 
1 . 00 42.46 
1.00 41.60 
1.00 40.91 
.1.00 42.92 
1.00 43.69 
1.00 43. 91 
' 1.00 44.7 1 
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19.477 -45.650 -17.910 1.00 37.28 

20.116 -46.263 -16.687 1.00 40.54 

21.605 -46.395 -16.906 1.00 42.34 

22.291 -45.113 -16.802 1.00 44.33 

23.449 -44.842 -17.392 1.00 45.20 

24.045 -45.764 -18.135 1.00 45.60 

24.019 -43.656 -17.227 1.00 45.68 

17.449 -45.005-19.180 1.00 33.59 

18.167 -44.255 -19.844 1.00 32.61 

16.212 -45.321 -19.533 1.00 30.94 

15.622 -44.804 -20.755 1.00 30.77 

14.143 -45.158 -20.810 1.00 30.59 

13.473 -44.772 -22.109 1.00 29.73 
11.981 -44.971-22.04 4 1.00 28.04 
11.294 -44.295 -21.279 1.00 29.59 
11.468 -45.905 -22.838 1.00 26.98 
15.783 -43.291 -20.885 1.00 29.70 
16.268 -42.801 -21.902 1.00 29.79 
15.378 -42.554 -19.857 1.00 29.38 

15.474 -41.099 -19.904 1.00 29.46 
14.772 -40.472 -18.700 1.00 29.25 
13.265 -40.416 -18.883 1.00 29.32 
12.575 -39.585 -17.826 1.00 29.84 

13.191 -38.728 -17.183 1.00 29.52 
11.281 -39.821 -17.647 1,00 28.95 
16.906 -40.613 -20.005 1.00 29.36 
17.173 -39.557 -20.585 1.00 29.12 
17.835 -41.374 -19.442 1.00 28.95 
19.228 -40.984 -19.550 1.00 29.55 
20.136 -41.934 -18.768 1.00 31.40 
21 587-41.780 -19.148 1.00 33.37 
22.332 -40.682 -18.717 1.00 34.57 
23 644 -40.490 -19.148 1.00 35.97 

22.192 -42.684 -20.017 1.00 34.90 
23.497 -42.500 -20.453 1.00 36.03 
24^214 -41.402 -20.019 1.00 36.29 
25.499 -41.215 -20.475 1.00 39.44 
19.593 -41.042 -21.032 1.00 28.80 
20192 -40.113 -21.567 1,00 29.22 
19 227 -42.135 -21.697 1.00 27.57 
19 542 -42.282 -23.116 1.00 28.07 
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4814 
4815 
4816 
. 4817 
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4821 
4822 
4323 
4824 
4825 
4826 
4827 
4823 
' 4829 
4830 
4831 
4832 
4833 
4834 
4835 
4836 
4337 
4838 
4839 
* 4.840 
484 1 
484 2 
4843 
484 4 



CB 
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CD2 
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CZ2 
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C 
O 
N 

CA 
CB 
CG 
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C 
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N 

C/> 
CB 
C 
O 
N 

CA 
CB • 
CG 
CD1 
CD2 
C 
O 
N 

CD 
CA J 
CB 
CG 
C 
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N 

CA 
CB 
CG 
CD1 
CD2 
C 
O 
N 

CA 
'CB 
.CG 
CD 
OE1 
OE2 
C* 
O 

N ' 
CA 
CB 
CG 
CD 
CE 
NZ 
C 
O 



TRP 
TRP 
TRP 
TRP 
TRP 
TRP 
TRP 
TRP 
TRP 
TRP 
TRP 
TRP 
ASN 
ASN 
ASN 
ASN 
ASN 
ASN 
ASN 
ASN 
ALA 
ALA 
ALA 
ALA 
ALA 
' LEU 
LEU 
LEU 
LEU 
LEU 
LEU 
LEU 
LEU 
PRO 
PRO 
PRO 
PRO 
PRO 
PRO 
. PRO 
LEU 
LEU 
LEU 
LEU 
LEU 
LEU 
LEU 
LEU 
GLU 
GLU 
GLU 
GLU 
GLU 
GLU 
GLU 
GLU- 
GLU 
LYS 
LYS 
LYS 
LYS 
LYS 
LYS 
LYS 
LYS 
LYS 



B 291 
B 291 
B 291 
B 291 
B 291 
B 291 
B 291 
B 291 
B 291 
B 291 
B 291 
B 291 
B 292 
B 292 
B 292 
B 292 
B 292 
B 292 
B 292 
B 292 
B 293 
B 293 
B 293 
B 293 
B 293 
B 294 
B 294 
B 294 
B 294 
B* 294 
B 294 
B 294 
B 2 94 
B 295 
B 295 
B 295 
B 295 
B 295 
B 295 
B 295 
B 296 
B 296 
B 296 
B 296 
B 296 
B 296 
B 296 
B 296 
B 297 
B 2 97 
B 297 
B 297 
B 297 
B 297 
B 297 
■ B 297 
B 297 
B 298 
B 298 
B 2 98 
B- 298 
B 293 
B 298 
B -2 98 
B 298 
B 2-98 



19.217 
20 . 070 
21 .487 
21 .873 
22 . 470 
19. 663 
20.741 
23 . 203 
23. 794 
24.146 
18 . 808 
19. 283 
17 . 658 
16 . 902 
15.484 
14 . 590 
14 . 842 
13. 523 
17 . 605 
17 . 566 
18 .242 
18 . 926 
18 . 940 
20 . 346 
20 . 855 
20 . 978 
22 . 354 
22 . 745 
24 .226 
25 . 085 
24 . 470 
22 . 644 
23. 677 
21.748 
20 . 560 
21. 998 
20 . 740 
20.382 
22 .219 
23. 138 
21.375 
21. 521 
20.283 
19. 066 
17 . 968 
19.. 496 
22.784 
23.435 
23 . 143 
24 . 354 
24 . 470 
25.716 
25. 812 
25. 655 
26.045 
25.577 
26. 528 
25.543 
26. 644 
26. 379 
26.519 
26.251 
26. 461 
26. 094 
26. 829 
27 ;945 



-43.705 
-44 .750 
-44 . 908 
-45.966 
-44 . 257 
-45. 695 
-46. 428 
-46. 387 
-44 . 676 
-45.732 
-41 . 256 
-40. 900 
-40.779 
-39.784 
-39. 599 
-40.811 
-41 . 641 
-40 . 900 
-38 .427 
-37 .687 
-38 . 105 
-36.822 
-36.422 
-36.800 
-35.743 
-37 .969 
-38 .088 
-39.566 
-39.864 
-39.05.5 
-41.368 
-37 . 392 
-36.741 
-37 . 532 
-38 . 400 
-36. 870 
-37 . 201 
-38. 548 
-35. 358 
-34 .792 
-34 .703 
-33. 265 
-32. 685 
-32. 458 
-31.718 
-31.630 
-32 . 935 
-31.917 
-33.785 
-33. 530 
-34 . 483 
-34 .210 
-35.087 
-34 . 555 
-36'. 307 
-33 .686 
-32 .910 
-34 . 688 
-34 .947 
-36 .216 
-37 .514' 
-38.734 
-40.033 
-41.235 
-33.768' 
-33.455 



23. 599 
22.936 
23. 069 
•22.213 
•23.326 
-22.035 
-21 . 595 
-22 . 093 
-23. 707 
-22 .845 
-23. 978 
-25.053 
-23 . 508 
-24 .270 
-23.709 
-23. 928 
-24 .798 
-23 . 146 
-24 .258 
-25.244 : 
-23. 139 
-22. 979 
-21.506 
-23. 521 
-23. 902 
-23. 560 
-24 .032 
-24 ,121 
-24 . 350 
-23. 380 
-24 . 169 
-25. 359 
-25. 507 
-26. 351 
-26.437 
-27 . 635 
-28 . 439 
-27 . 932 
-27 .480 
-28 . 069 
-26. 688 
-26.481 
-25 . 779 
-26.679 
-25 . 911 
-.27 .8 93 
-25. 688 
-25 . 944 
-24 .730 
-23.953 
-22 .755 
-21 . 908 
-20 . 671 
-19.551 
-20.820 
-24 .34 6 
-24 .74 6 
-25.722 
-26 . 648 
-27 . 464 
-26. 680 
-27 . 568 
-26. 801 
-27 . 607 
-27. 598 
-27 . 993 
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1 . 00 
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00 
00 
00 
00 
00 
00 
00 
00 
00 
1.00 
1 .00 
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1 . 

1 . 

l\ 

1 . 

1 . 

1 . 

1 

1 

1 

1 

1 

1 

1 
1 
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00 
00 
00 
00 
00 
00 
00 
00 
00 
00 
00 
. 00 
.00 
.00 
.00 
.00 



1 . 00 

-1 ,00 



29. 42 

31 .82 
33.22 
33.81 
33. 13 

32 . 62 
33.43 

34 . 69 

35 . 4 1 
35 .26 
26.84 
26. 55 
25 . 52 
26. 06 

24 .78 
24.46 
25.33 
23.83 

25 . 99 
26.18 
25.66 
25 . 69 

24 . 17 
25. 67 

25 . 52 
25. 62 

25. 90 

26. 51 
28. 12 
28 .29 
28.15 
25. 66 
24 .79 
26.24 
25.75 
26.15 • 
26.70 
26.84 
25.78 
2 5 . 0 5 
25. 66 
25.42 
24.45 

25. 59 
23.56 
25.87 
26.07 
25.26 

26. 14 
28 . 78 
29.05 
30. 81 
31.84 
31.74 
31.56 
29.06 
29. 11 
31.21 
31.90 
33.65 
36.05 

37 . 69 

38 . 51 
39.05 
31.69 
30.87 
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BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 
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BBBBATOM 
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BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 



4845 

4846 

4 84 7 

4848 

484 9 

4850 

4851 

4852 

4853 

4854 

4855 

4856 

4857 

4858 

4859 

4860 

4861 

4862 

4863 

4864 

4865 

4866 

4867 ' 

4868 

4869 

4870 

4871 

4872 

4873 

4874 

4875 

4876 

4877 

4878 

4879 

4880 

4881 

4882 

4883 

4884 

4885 

488 6 

4887 

4888 

4889 

4890 

4891 

4892 

4893 

4894 

4895 

4896 

4897 

4898 

4899 

4900 

4901 

4902 

4903 

4904 

4905 

4906 

4907 

4908 

4909 

4910 



N 

CA 
CB 
C 
O 
N 

CA 
C 
O 
N 

CA 
CB 
C 

o 

N 

CA 
CB 
C 
O 
N 

CA 
.CB 
CG 
CD 
CE 
NZ 
C 
O 
N 

CA 
CB 
CG2 
CGI 
CD1 
C 
O 
N 

CA 
CB 
CG2 
CGI 
CD1 
C 
O 
N 

CA 
CB 
CG 
CD 
OEl 
OE2 
C 
O 
N 

CA 
CB 
CG 
CD 
OEl 
NE2 
C 
0 
N 

CD 
CA 
CB 



ALA B 
ALA B 
ALA B 
ALA B 
ALA B 
GLY B 
GLY B 
GLY B 
GLY B 
ALA B 
ALA B 
ALA B 
ALA B 
ALA B 
ALA B 
ALA B 
ALA B 
ALA B 
ALA B 
LYS B 
LYS B 
LYS B 
LYS B 
LYS B 
LYS B 
LYS B 
LYS B 
LYS B 
ILE B 
ILE B 
ILE B 
ILE B 
ILE B 
ILE B 
ILE B 
ILE B 
ILE B 
ILE B 
ILE B 
ILE B 
ILE B 
ILE B 
ILE B 
ILE B 
GLU B 
GLU B 
GLU B 
GLU B 
GLU B 
GLU B 
GLU B 
GLU B 
GLU B 
GLN B 
GLN B 
GLN B 
GLN B 
GLN B 
GLN B 
GLN B 
GLN B 
GLN B 
PRO B 
PRO B 
PRO B 
PRO B 



299 
299 
299 
299 
299 
300 
300 
300 
300 
301 
301 
301 
301 
301 
302 
302 
302 
302 
302 
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304 
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304 
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305 
305 
305 
305 
305 
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305 
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306 
306 
306 
306 
307 
307 
307 
307 
307 
307 
307 
307 
307 
308 
308 
308 
308 



25.725 
25.773 
24.415 
26. 173 
26. 278 
26. 398 
26.777 
25 . 619 
25.832 
24 . 392 
23.214 
22 . 174 
22 .591 
21 . 705 
23 . 066 
22 .516 
21 . 243 
23.503 
24 . 561 
23. 156 
23. 979 
24 . 632 
25 . 466 
26. 150 
27 . 083 
27 . 827 
23. 083 
22 . 015 
23. 520 
22.753 
22.786 
21.977 
22.242 
22 . 380 
23. 308 
24 . 511 
22 . 428 
22.843 
22.713 
23 . 063 
23 . 660 
23. 674 
21 . 934 

20 . 806 
22.429 
21.664 
22 . 356 
22 . 529 
23.114 
22 . 468 
24.214 

21 . 531 
22.241 
20.612 
20.377 
19. 397 
18 . 896 

17 . 879 

18 . 181 
16. 659 
21.647 
21.796 

22 . 580 
22.456 
23.828 
24 .533 



-33.1-16 
-31 . 965 
-31.778 
-30. 685 
-29. 615 
-30.796 
-29. 635 
-28. 665 
-27 .4 66 
-29. 179 
-28 . 333 
-28 . 644 
-23 . 510 
-27 .746 
-29. 504 
-29.770 
-30. 595 
-30. 507 
-30. 948 
-30. 613 
-31. 340 
-30. 401 
-31. 146 
-30. 186 
-30. 912 
-29. 952 
-32.319 
-31 . 948 
-33. 570 
-34 . 598 
-35.946 
-36. 996 
-35.769 
-37 .009 
-34.855 
-35 .012 
-34 .869 
-35. 178 
-33.977 
-34 .416 
-32.855 
-31 . 653 
-36.302 
-36.067 
-37.532 
-38 .702 
-39 . 989 
-40 .067 
-4 1 . 388 
-42 .4 33 
-41.378 
-38 .704 
-37 . 984 
-39. 514 
-39.599 
-40.734 
-40.769 
-41 . 867 
-43.051 
-41.481 
-39.784 
-39. 154 
-40. 648 
-41.593 
-40.891 
-41.954 



-27 . 952 
-28 .84"? 
-29. 521 
-28 . 106 
-28 . 709 
-26.801 
-26.017 
-25.810 
-25. 644 
-25. 820 
-25. 638 
-26.717 
-24 .254 
-23.863 
-23.507 
-22 . 186 
-22 . 327 
-21 . 288 
-21 .739 
-20.009 
-19.048 
-18.036 
-16.986 
-16.025 
-15.056 
-14 . 181 
-18 . 302 
-17 .802 
-18 .234 
-17 .550 
-18 . 316 
-17 . 555 
-19.733 
-20. 599 
-16. 160 
-15. 986 
-15. 168 
-13. 813 
-12.858 
-11.432 
-13.299 
-12. 367 
-13. 351 
-12.932 
-13. 467 
-13.061 
-13. 530 
-15.035 
-15. 504 
-15.289 
-16.099 
-11. 552 
-10.858 
-11 .048 
-9.613 
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1 .00 
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1 . 00 
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1 . 00 
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-3.776 
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1 .00 
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1 . 00 
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i : oo 

1 . 00 
1.00 
1 . 00 
1.00 
1.00 
1 . 00 
1. 00 
1. 00 
1 .00 
1.00 



00 
00 
00 
00 
00 



31 . 63 
3.0. 38 • 
30.22 
29.94 
30. 96 
27 . 41 
26.18 
25. 19 
24 . 70 
23.51 
22 . 50 
23.44 
22 . 09 

19 . 90 
20.94 

21 .78 

20 . 10 

22 . 69 
22 .25 

24 . 62 
25.86 
27 .85 
29.37 
32.41 
33.22 
33.62 
26.41 
25.76 

25 . 65 
27 . 17 
27 . 06 
28.49 
27 . 61 
27.05 
27 . 00 
27 . 4 6 
27 . 22 
29.01 

28. 91 
30. 98 

29. 51 
29.43 
29. 64 
29. 25 
32. 18 
34 . 65 
34 .79 
36. 39 
36.86 
36. 59 
37.75 
35. 50 
35.55 
38 . 41 
40. 54 
40. 99 
42.76 
44 . 33 
44.46 
45. 30 
40 . 94 
41.24 
42.16 
42.40 
43.20 
43.16 
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4911 
4912 
4913 
4914 
4915 
4916 
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4918 
4919 
4920 
4921 
4922 
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4934 
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4940 
4 94 1 
4 942 
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4 945 
4 94"6 
4947 
4948 
4 94 9 
4950 
4951 
4952 
4 953 . 
4 95 4 
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4956 
4957 
4958 
4959 
4960 
4961 
4962 
4963 
4964 
4965- 
4966 • 
4967 
4968 
4969 
4970 
4971 
4972 
4973 
4974 
4975 
4976 
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C 
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N 

CA 
CB 
CG 
CD 



PRO B 308 

PRO B 308 

PRO B 308 

GLN B 309 

GLN B 309 

GLN B 309 

GLN B 309 

GLN B 309 



OE1 GLN B 309 
NE2 GLN B 309 
C . GLN B 309 
O GLN B 309 
N LEU B 310 
CA LEU B 310 
CB LEU B 310 
CG LEU B 310 
CD1 LEU B 310 
CD2 LEU B 310 
C LEU B 310 
O LEU B 310 
N SER B 311 
. Ch SER B 311 
CB SER B 311 
OG SER B 311 
C SER B 311 
O . SER B 311 
N . VAL B -312 
CA VAL B 312 
CB VAL B 312 
CGI VAL B 312 
CG2 VAL B 312 
C VAL B-312 
O VAL B- 312 
N ASP B 313 
CA ASP B 313 
CB ASP B 313 
cg: ASP B 313- 
OD1 ASP B 313 
OD2- ASP B 313 
C ASP B 313 
O ASP B.313 
N ' . ALA ■ B 314 
CA. ALA B.314 
CB ALA B 314 
C ALA B 314 
O ALA B 314 
N VAL B 315 
CA VAL. B 315 
• CB VAL B 315 
CGI VAL B 315 
CG2 VAL B 315 
C VAL B 315 
O . VAL B 315 
N 'ALA B 316 
CA ALA B 316 
CB ALA B 316 
C ALA B 316 
0 ALA B 316 
N - ASN B 317 
CA ASN B 317 
■ CB . ASN B 317 
CG ASN B 317 
ODI ASN B '317 
ND2 ASN B 317 
C ' ASN B 317 
O ASN B 317 



23.395 - 
24 .719 - 
25.652 - 
24 . 442 - 
25.247 - 
26.071 - 
27 . 183 - 
26.683 - 
27 . 459 - 
25.388 - 
24.4 17 - 
24.955 - 
23.113 - 
22.232 - 
20.779 ■ 
19.730 • 
19.545 
20. 160 
22. 342 
22.246 
22.541 
22. 660 
24 .059 
25.022 
22.434 
22.476 
22.202 
21 . 990 
21 .707 
21 .546 
20 .445 
23. 228 
23.123 
24 .406 
25.642 
26.840 
26.817 
26. 662 
26. 958 
25.910 
26. 442 
25. 555 
25. 782 
25.441 
24 . 928 
25.412 
23. 661 
22.755 
21.305 
20 . 392 
20 . 810 

23 . 222 
23 . 398 
23.445 
23.888 

24 .059 

25 . 185 
25.289 
26. 178 

27 . 44 4 

28 . 493 
28 . 940 
29.091 
29. 170 
27 . 270 
27 . 887 



42 . 708 
39. 668 
39.724 
38.569 
37 .361 
37.104 
38.113 
39.437 
•40 . 376 
-39.515 
-36. 119 
-35.013 
-36.297 
-35 . 166 
-35.549 
-34 .437 
-33. 699 
-33.472 
-34 . 659 
-35. 428 
-33. 354 
-32.714 
-32.928 
-32 . 135 
-31.227 
-30. 741 
-30 . 509 
-29. 074 
-28 . 482 



-26. 976 
-29. 106 
-28 .417 
-27 . 612 
-28 .780 
-28 .202 
-28 . 656 
-28 .085 
-26.855 
-28 .865 
-28 .516 
-27 . 677 
-29.719 
-30 .099 
-31 . 575 
-29.220 
-28.714 
-29.046 
-28 .215 
-28 .298 
-27 . 382 
-29.738 
-26.764 
-26.125 
-26.251 
-24 .872 
-24 .54 5 
-24 .584 
-23.572 
-25.463 
-25.246 
-26.271 
-26.068- 
-24 .933 
-27 . 171 
-25.310 
-24 .543 



-9. 938 
-8.250 
-7.449 
-8 . 944 
-8.787 
10.056 
10.329 
10.885 
■11 .054 
■11 . 179 
-8 .479 
-8 .404 
-8.289 
-8.022 
-8 . 330 
-8 . 480 
-7 . 166 
-9. 580 
-6. 591 
-5. 634 
-6.457 
-5.154 
-4 . 569 
-5.248 
-5. 357 
-6.485 
-4 . 266 
-4 .341 
-2.938 
-3 .023 
-2 . 362 
-4 . 94 6 
-5.875 
-4.444 
-4 . 957 
-4 . 120 
-2 .718 
-2.578 
-1.751 
-6.425 
-7.-146 
-6.868 
-8.254 
' - 8 .4 60 
-9.174' 
-10. 187 

-8.815 - 
. -9.612 
-9.088 
. -9.898 
-9.172 
-9.561 
-10 . 590 
-8 . 359 
-8 . 199 
-6.714 
. -8.955 
-9.652 
-8.323 
-9.518 
-9.081 
-7.645 
-7.183 
-6.933 
-11 .026 
-11.765 



1 .00 43. 10 
1.00 43.82 
1 . 00 44.24 
1 . 00 4 4.04 
1.00 43.46 
1.00 44.98 
1.00 46.93 
1 .00 47 .93 
1 . 00- 48.87 
1.00 49.17 
1.00 42.87. 
1.00 43.33 
1 .00 41 .06 
1.00 39.65 
1.00 39.95 
1.00 39.98 
1.00 41.39 
1 .00 40.19 
1.00 38.83 
1.00 39.25 
1.00 36.17 
1.00 34.90 
1 .00- 35.08 
1.00 33.92 
1.00 33.51 
1.00 33.60 
1.00 32.42 
1.00 31.50 
1.00 31.69 
1.00 31.54 
1.00 31.93 
1.00 31-40 
1.00 30.04 
1.00 30. 18 
1.00 29.61 
1.00 32.36 
1.00 34.21 
1.00 36.52 
l.OO 38.92 
1.00 28.84 
1.00' 23.96 
1.00 26.61 
1.00 26.47 
1.00 26.05 
1.00 25.78 
1.00 24.94 
' 1.00 25.00 
1.00 25.33 
1.00 24.74 
1.00 23.53 
1* 00 25.69 
1.00 25.02 
1.00 25.73 
1.00 26.04 
1.00 27.13 
1.00 27.12 
1.00 28.17 
1.00 27,04 
1.00 23.72 
1.00 28.52 
1.00 31.30 
1.00 34.01 
1.00 33.87 
1.00 35.63 
1 .00 27.23 
1.00 26.36 
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N 


THR 


B 
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CA 


THR 


B 


318 
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THR 


B 
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BBBBATOM 
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OG1 


THR 


B 
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CG2 


THR 


B 
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BBBBATOM 
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C 


THR 


B 
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4 983 


O 
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B 
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N 
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B 
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CA 
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B 
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LEU 


B 
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4987 


CG 
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B 
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BBBBATOM 
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CD1 


LEU 


B 


319 


BBBBATOM 


4 989 


CD2 


LEU 


B 


319 


BBBBATOM 


4990 


C 


LEU 


B 


319 


BBBBATOM 


4991 


O 


LEU 


B 


319 


BBBBATOM 


4992 


N 


ALA 


B 


320 


BBBBATOM 


4993 


CA 


ALA 


B 


320 


BBBBATOM 


4994 


CB 


ALA 


B 


320 


BBBBATOM 


4995 


C 


ALA 


B 


320 


BBBBATOM 


4996 


O 


ALA 


B 


320 


BBBBATOM 


4997 


N 


GLY 


B 


321 


BBBBATOM 


4998 . 


CA, 


GLY 


B 


321 


BBBBATOM 


4999 


C 


GLY 


B 


321 


BBBBATOM 


5000 


o 


GLY 


B 


321 


BBBBATOM 


5001 


N 


TRP 


B 


322 


BBBBATOM 


5002 


CA 


TRP 


B 


322 


BBBBATOM 


5003 


CB 


TRP 


B 


322 


BBBBATOM 


5004 


CG 


TRP 


B 


322 


BBBBATOM 


5005 


CD2 


TRP 


B 


322 


BBBBATOM 


5006 


CE2 


TRP 


B 


322 


BBBBATOM 


5007 


CE3 


TRP 


B 


322 


BBBBATOM 


5008 


CD1 


TRP 


B 


322 


BBBBATOM 


5009 


NE1 


TRP 


B 


322 


BBBBATOM 


5010 


CZ2 


TRP 


B 


322 


BBBBATOM 


5011 


C23 


TRP 


B 


322 


BBBBATOM 


5012 


CH2 


TRP 


B 


322 


BBBBATOM 


5013 


C 


.TRP 


B 


322 


BBBBATOM 


5014 


O 


TRP 


B 


322 


BBBBATOM. 


5015 


N 


SER 


3 


323 



BBBBATOM 5016 CA SER B 32 3 

BBBBATOM 5017 CB SER B 323 

BBBBATOM 5018 OG SER B 323 

BBBBATOM 5019 C SER B 323 

BBBBATOM 5020 O SER B 323 

BBBBATOM 5021 N ARG B 32 4 

BBBBATOM 502 2 CA ARG B 32 4 

BBBBATOM 5023 CB ARG B 324 

BBBBATOM 5024 CG ARG B 324 

BBBBATOM 5025 CD ARG B 324 

BBBBATOM 5026 NE ARG B 324 

BBBBATOM 5027 CZ ARG B 32 4 

BBBBATOM 5028 NH1 ARG B 324 

BBBBATOM 5029 NH2 ARG B 324 

BBBBATOM 5030 C ARG B 324 

BBBBATOM 5031 O ARG B 324 

BBBBATOM 5032 N GLU 3 325 

BBBBATOM 5033 CA GLU B 325 

BBBBATOM 5034 CB GLU B 32 5 

BBBBATOM 5035 CG GLU B 32 5 

BBBBATOM 5036 CD GLU B 325 

BBBBATOM 5037 OE1 GLU B 325 

BBBBATOM 5038 OE2 GLU B 325 

BBBBATOM 5039 C GLU B 325 

BBBBATOM 5040 O GLU B 325 

BBBBATOM 504 1 N THR B 326 

BBBBATOM 5042 CA THR B 326 



PCT/US0 1/11500 

26.424 -26.224 -11.482 1.00 26.95 

26.174 -26.371 -12.906 1.00 27.04 

25.203 -27.537 -13.189 .1.00 27.91 

25.843 -28.779 -12.872 1.00 30.24 

24.784 -27.539 -14.655 1.00 27.49 

25.579 -25.087 -13.480 1.00 26.50 

26.085 -24.548 -14.465 1.00 26.71 

24.516 -24.588 -12.859 1.00 24.75 

23.883 -23.370 -13.357 1.00 25.21 

22.579 -23.090 -12.600 1.00 23.70 

21.496 -24.153 -12.803 1.00 23.20 

20.384 -23.962 -11.785 1.00 22.93 

20.963 -24.065 -14.237 1.00 22.39 

24.814 -22.169 -13.253 1.00 25.23 

24.888 -21.368 -14.168 1.00 25.17 

25.540 -22.055 -12.147 1.00 24.41 

26.445 -20.931 -11.957 1.00 24.59 

27.069 -20.987 -10.547 1.00 24.46 

27.549 -20.865 -13.010 1.00 24.21 

28.091 -19.800 -13.281 1.00 24.35 

27.878 -22.005 -13.601 1.00 24.35 

28.934 -22.031 -14.591 1.00 24.34 

28.509 -21.642 -15.993 1.00 24.34 

29.345 -21.585 -16.890 1.00 24.63 

27.225 -21.357 -16.188 1.00 22.62 

26.733 -21.007 -17.521 1.00 21.72 

25.499 -21.841 -17.858 1.00 20.09 
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BBBBATOM . 
BBBBATOM 
BBBBATOM 
BBBBATOM 
BBBBATOM , 
'BBBBATOM 
BBBBATOM 
BBBBATOM 
BBBBATOM' 
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BBBBATOM 
BBBBATOM 
BBBBATOM 
■BBBBATOM * 
BBBBATOM 
BBBBATOM 
BBBBATOM 
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PCT/USO 1/1 1500 
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CB 
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N' 
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SD 
CE 
C 

O ' 
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O 
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CA 
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c 
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THR 
THR 
THR 
THR 
THR 
LEU 
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LEU 
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ARG 
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B 333 
B 333 
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B 333 
B 333 
B 333 
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B 334 
B 334 
B 334 
B 334 



27 . 9 67 ' 
29 . 34 6 
27 .804 
26.034 
25.817 
25 . 068 
23.659 
22.791 
22 . 922 
22. 192 
22. 361 
23. 145 
22 . 352 
23. 584 
23. 175 
23.706 
23.475 
21 . 988 
23. 984 
23.716 
23.003 
24 . 977 
25. 567 
27 . 069 
27 . 866 
27 . 533 
24.819 
24 . 552 
24.494 
23. 771 
23. 642 
24 . 957 
24 . 805 
24.118 
22. 389 
21 . 924 
21 . 7 37 
20. 4 12 
19. 868 
20. 481 
19. 644 
21.489 
21 . 626 
22 . 709 
22 . 328 
23.428 
24 . 594 
23.118 
21 . 921 
21.412 
22.739 
23.040 
24 . 117 
25 . 503 
26.4 43 
27.810 
28.772 
28 . 514 
29 . 990 
21.763 
21.548 
20. 907 
19. 648 
18. 915 
18.:778 
18 .285 



-20. 635 
-20. 294 
-22. 143 
-20. 551 
-21.478 
-19. 932 
-20.305 
-19 .514 
-19.919 
-18 . 930 
-21 .312 
-20 .096 
-20 .899 
-19.014 
-18 .745 
379 
17.021 
17 .112 
15.608 
19.868 
20.400 
20 .244 
335 



-17 



-21 

-21.556 
-20. 591 
-22 . 957 
-22. 644 
-23. 382 
-22 .933 
-24 . 153 



-24 
-24 
-24 



.292 
. 571 
. 609 



-26 .250 
-24 . 149 
-25. 167 
-22 . 996 



-22 
-21 



871. 
,462 
-23. 162 
-23.879 
-22. 613 
-22 .827- 
-21 . 912 
-20.455 
-19. 522 
-19. 765 
-18 . 542 
-24 . 280 
-24 .814 
-24 . 923 
-26. 330 
-26. 789. 
-26.222 
-26. 4 67 
-26.041 
-26. 001 
-26. 361 
-25. 596 
-27 . 147 
-28. 151. 
-26. 713 
-27. 420 
-26. 800 
-27 . 333 
-28 . 339 



21.725 
21.561 
21 . 690 
23. 321 
24 . 107 
•22 . 643 
-22 .780 
-21.791 
-20 . 309 
-19.416 
-20 . 129 
-24 . 198 
-24 . 692 
-24 . 845 
-26 . 222 
-26. 690 
-28 . 173 
-28 .494 
-28 . 460 
-27 . Ill 
-27 . 953 
-26. 910 
-27 . 688 
-27 . 336 
-28 . 029 
-27.757 
-27 .457 
-28 . 398 
-26. 200 
-25.870 
-24 . 350 
-23 . 647 
-21 . 855 
-21 . 622 
-26. 519 
-27 . 029 
-26 . 504 
-27 . 098 
-26.859 
-28 . 594 
-29 . 130' 
-29 . 269 
-30 . 704 
-31 . 274 • 
-31 . 143 
-31 . 572 
-31 . 197- 
-32 . 273 
-31 .025 
-32. 006 
-30 . 200 
-30 .408 
-29 . 427 
-29.7.37: 
-28 . 570 
-28 .846 
-27 . 932 
-26.678 
-28 .269 
-30 .218 
-30.895 
-29.299 
-29.063 
-27 .882 
-30 . 317 
-30.827 



1 .00 
1 . 00 
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1 . 00 
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1 
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00 
00 
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00 
00 
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16. 62 
18 . 31 
18 . 20 
17 . 40 
16.65 . 
17 . 90 
17 . 27 
17 . 35 
16.74 
14 . 82 
18.13 
17 . 58 

16 . 08 
17.87 

17 . 39 

17 .84 

18 . 65 
16.92 
18 .47 
17.74 
17.59 

20 .12 

21 . 30 

22 . 69 
25 .80 
24 .48 
20 .40 
20 . 34 
20 . 31 
19. 91 

20 .86 

21 . 37 

22 .76 
22 . 37 
19.43 
20.85 
18 .81 

18 . 49 

17 . 16 

19 . 04 

18 .16 
19.87 
21.47 
22 . 13 
24 . 10 
26.61 
26. 92 
27 . 58 
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l v . 00 
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22. 44 
22.84 

22 . 30 
23.77 
25 .03 
26.79 
28 . 88 
29.09 
30.32 
32 .07 
31 . 36 

23 . 13 
23.. 45 
22.82 
22 .83 
22 .03 
22.86 
21 . 38 



147 



WO 01/90301 



PCT/USOl/11500 



BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BB3BATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

3B3BATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 

BBBBATOM 



5109 

5110 

5111 

5112 

5113 

5114 

5115 

5116 

5117 

5118 

5119 

5120 

5121 

5122 

5123 

5124 

5125 

5126 

5127 

5128 

5129 

5130 

5131' 

5132 

5133 

5134 

5135 

5136 

5137 

5138 

5139 

5140 

5141 

5142 

5143 

5144 

5145 

5146 

5147 

5148 

5149 

5150 

5151 

5152 

5153 

5154 

5155 

5156 

5157 

5158 

5159 

5160 

5161 

5162 

5163 

5164 

5165 

5166 

5167 

5168 

5169 

5170 

5171 

5172 

5173 

5174 



N 

CA 
CB 
CG 
CD 
NE 
CZ 
NH1 
NH2 
C 
O 
N 

CA 
CB 
C 
O 
N 

CA 
CB 
C 
O 

• N , 
CA 
CB 
OG 
C 
O 
N 

CA 
CB 
CG2 
CGI 
CD1 
C 
O 
N 

CD 
CA 
CB 
CG 
C 
O 
N 

CA 
CB 
CG 
OD1 
OD2 



CA 
CB 



CA 
CB 
OG1 
CG2 



N 

CA 
CB 
CG 



ARG 
ARG 
ARG 
ARG 
ARG 
ARG 
ARG 
ARG 
ARG 
ARG 
ARG 
ALA 
ALA 
ALA 
ALA 
ALA 
ALA 
ALA 
ALA 
ALA 
ALA 
SER 
SER 
SER 
SER 
SER 
SER 
ILE 
ILE 
ILE 
ILE 
ILE 
ILE 
ILE 
ILE 
PRO 
PRO 
PRO 
PRO 
PRO 
PRO 
PRO 
ASP 
ASP 
ASP 
ASP 
ASP 
ASP 
ASP 
ASP 
ALA 
ALA 
ALA 
ALA 
ALA 
THR 
THR 
THR 
THR 
THR 
THR 
THR 
GLU 
GLU 
GLU 
GLU 



B 335 
B 335 
B 335 
B 335 
B 335 
B 335 
B 335 
B 335 
B 335 
B 335 
B 335 
B 336 
B 336 
B 336 
B 336 
B 336 
B 337 
B 337 
B 337 
B 337 
B 337 
B 338 
B 338 
B 338 
B 338 
B 338 
B 338 
B 339 
B 339 
B 339 
B 339 
B 339 
B 339 
B 339 
B 339 
B 340 
B 340 
B 340 
B 340 
B 340 
B 340 
B 340 
B 341 
B 341 
B 341 
B 341 
B 341 
B 341 
B 341 
B 341 
B 342 
B 342 
B 342 
B 342 
B 342 
B 343 
B 343 
B 343 
B 343 
B 343 
B 343 
B 343 
B 344 
B 344 
B 344 
B 344 



18.602 - 
17.795 - 
17.815 - 
16.804 - 
15.381 - 
14 . 435 - 
13.607 - 
13.601 - 
12.791 - 
18.348 - 
17.595 - 
19.671 - 
20.330 - 
21 . 830 - 
20.101 - 
20.052 - 
19.959 - 
19.740 - 
20.205 - 
18.267 - 
17.887 - 
17.442 - 
16.008 - 
15.286 • 
13.921 • 
15. 593 
16.274 
14 . 466 
13.882 
13. 989 
13.271 
15 . 463 
15. 688 
12 . 416 
11 . 601 
1 2 . 0.6 8 
12 . 999 
10.733 
10 . 994 
12.052 
9.551 
8 . 410 
9. 803 
8.711 
8 . 802 

10. 101 

11 . 125 
10 . 102 

8 . 655 
. 8.377 
8 . 908 
8 .875 
9. 174 
7 . 530 
7 . 475 
6.441 
, 115 
. 998 
.119 
. 628 
. 934 
. 533 
.234 
5.085 
5.414 
4 .412 



5 . 

3. 

4 . 

2 

4 

4 

5. 



26.114 
25.892 
24 .407 
23.995 
24 . 129 
•23. 414 
•22 .469 
-22 . 123 
-21.862 
•26.724 
-27 . 360 
-26. 737 
-27 . 477 
-27 . 174 
-28 . 985 
-29. 665 
-29. 501 
-30 . 925 
-31 . 320 
-31 . 276 
-32.445 
-30.254 
-30. 432 
-29.140 
-29. 196 
-30.822 
-30.503 
-31 . 523 
-31 . 941 
-33.472 
-33 . 880 
-33 . 887 
-35.378 
-31 .518 
-32 . 229 
-30.344 
-29. 525 
-29.730 
-28 . 349 
-28 . 642 
-30 .409 
-30. 222 
-31 . 180 
-31 . 820 
-31 .448 
-31 . 886 
-31 . 907 
-32 . 200 
-33. 336 
-34 .066 
-33. 807 
-35.238 
-35 .496 
-35.844 
-36. 901 
-35. 177 
-35. 696 
-34 .765 
-34 . 642 
-35.330 
-35.882 
-36. 952 
-34 .843 
-34 .933 
-33.589 
-32 .483 



•30.815 
•32 .002 
•32 . 361 
•33.413 
♦32 . 891 
•33.736 
-33 . 301 
-32 .016 
-34 . 157 
-33. 163 
-33 . 902 
-33 .297 
-34 . 372 
- 34 . 374 
-34 . 288 
-35. 303 
-33.072 
-32.865 
-31 ..467 
-33 . 047 
-33 .029 
-33.229 
-33. 408 
-33.029 
-33 . 385 
-34 . 829 
-35.801 
-34 . 921 
-36. 187 
-36. 384 
-37 . 658 
-36. 450 
-36. 667 
-36. 059 
-35. 479 
-36.611 
-37 .409 
-36. 600 
-37 . 194 
-38 . 205 
-37 .234 
-36. 861 
-38 . 334 
-39.056 
-40.536 
-41.159 
-40.443 
-42 . 367 
-38 .899 
-39.854 
-37 . 683 
-37 .411 
-35.927 
-37 .792 
-38 .416 
-37 .427 
-37 .744 
-37 .205 
-35 .782 
-37 .528 
-39.254 
-39.711 
-40.030 
-41.480 
-42.142 
-41.814 



00 23. 14 
00 23.54 
00 25. 17 
00 26.90 
00 30.87 
00 33.71 
00 35. 57 
00 34.54 
00 35.17 
00 23.86 
1 . 00 22 . 38 
1 . 00 25.1"? 
1.00 26.85 
1 . 00 26.27 
00 27 .80 
00 27 . 33 
00 29.45 
00 30.89 
00 31.47 
00 32.47 
00 34 . 14 
00 31 .76 
.00 32 . 4 1 
.00 31 . 94 
.00 34 . 53 
.00 33 . 16 
.00 32.25 



00 33.44 
00 34 . 35 
00 35. 51 
00 35 . 53 
00 36.09 
1.00 37.7 1 
1 . 00 34 . 84 
1 . 00 35 . 09 
1 . 00 34 . 93 
1.00 35.06 
1.00 34.94 
1.00 35.58 
1 .00 36.27 
1.00 34.07 
1 . 00 34.18 
1 . 00 33.42 
1.00 33.33 
1.00 35.03 
1.00 37 .02 
1.00 37.17 
1.00 40.57 



00 

.00 
.00 



32 . 11 
30.74 
30 . 63 
.00 29.09 
.00 28 . 23 
.00 28.32 
1 . 00 29.88 
1 . 00 28 . 28 



.00 28.55 
,00 29.04 
.00 28.04 
.00 28.54 
.00 29.50 
.00 28.08 
.00 30.76 
.00 32.00 
.00 34.02 
-.00 38.27 
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WO 01/9030 1 - 



PCT/USO 1/1 1500 



BBBBATOM 


5175' 


CD 


GLU 


B 


344 


BBBBATOM 


5176 


OE1 


GLU 


B 


344 


BBBBATOM 


5177 


OE2 


GLU 


B 


344 


BBBBATOM 


5178- 


C. 


GLU 


B 


344 


BBBBATOM 


5179 


O 


GLU 


B 


344 


BBBBATOM 


5180 


N ■ 


ARG 


B 


345 


BBBBATOM 


5181' 


CA 


ARG 


B 


345 


BBBBATOM 


5182 


CB 


ARG 


B 


345 


BBBBATOM 


5183 


CG 


ARG 


B 


345 


BBBBATOM 


5184 


CO 


ARG 


B 


345 


BBBBATOM 


5185 


NE 


ARG 


B 


345 


BBBBATOM ' 


5186 


CZ 


ARG 


B 


345 


BBBBATOM 


5187 


NH1 


ARG 


B 


345 


BBBBATOM 


5188 


NH2 


ARG 


B 


345 


BBBBATOM 


5189 


C 


ARG 


B 


345 


BBBBATOM 


5190 


O 


ARG 


B 


345 


BBBBATOM 


5191 


N 


VAL 


B 


346 


BBBBATOM 


5192 


CA 


VAL 


B 


346 


BBBBATOM 


5193 


CB 


VAL 


B 


346 


BBBBATOM 


5194 


CGI 


VAL 


B 


346 


"BBBBATOM 


5195 


CG2 


VAL 


B 


346 



BBBBATOM 5196 . C # VAL B 34 6 

BBBBATOM 5197" o' VAL B 346 

BBBBATOM 5198 N ALA B 34 7 

BBBBATOM 5199 CA ALA B 34 7 

BBBBATOM 5200 CB ALA B 34 7 

BBBBATOM 5201 C ALA B 347 

BBBBATOM 5202 O ALA B 347 

BBBBATOM 5203 N ASM B 34 8 

BBBBATOM 5204 CA ASN B 348 

BBBBATOM 5205 CB ASN B 348 

BBBBATOM 5206 CG ASN B 348 

BBBBATOM 5207 OD1 ASN B 348 

BBBBATOM 5208 ND2 ASN B 348 

BBBBATOM 5209 C ASN B 348 

BBBBATOM 5210 O . ASN B 348 

BBBBATOM 5211 N GLU B 349 . 

BBBBATOM 5212 CA GLU B 349 

BBBBATOM 5213 CB GLU B 34 9 

BBBBATOM 5214 CG GLU B 349 

BBBBATOM 5215 CD' GLU B 34 9 

BBBBATOM 521*6 OE1 GLU B 349 

BBBBATOM 5217 OE2 GLU :B 349 

BBBBATOM 5218 C . GLU :B 349 

BBBBATOM 5219 O - GLU B 34 9 

BBBBATOM • 5220 N VAL B 350 

BBBBATOM 5221 CA VAL B 350 

BBBBATOM 5222 CB VAL B. 350 

BBBBATOM 5223 CGI VAL B 350 

BBBBATOM 5224 CG2 VAL B 350 

BBBBATOM 5225 C VAL B 350 

BBBBATOM 5226 O- VAL B 350 

. BBBBATOM 5227 N SER B 351 

BBBBATOM 5228 CA" SER B 351 

BBBBATOM 5229 CB SER B 351 

BBBBATOM 5230 OG SER B 351 

BBBBATOM 5231 C SER B 351 

BBBBATOM 5232 O SER B .351 

BBBBATOM 5233 N ARG B.352 

BBBBATOM 5234 CA ARG B 352 

BBBBATOM 5235 CB ARG B 352 

* BBBBATOM 5236 CG ARG B 352 

BBBBATOM 5237 CD ARG B- 352 

BBBBATOM 5238 NE ARG B 352 

BBBBATOM 5239" CZ ARG 8-352 

BBBBATOM 524*0' NH1 ARG B 352 



4 807 -31.661 -40.594 1.00 40.96 
5.252 -32.252 -39.582 1.00 41.33 

4 665 -30.419 -40.649 1.00 4 2.30 

5 970 -36.032 -42.056 1.00 31.14 
5.534 -36.805 -42.909 1.0032.33 

7 209 -36.106 -41.587 1.00 30.66 

8 138 -37.123 -42.067 1.00 31.44 

9 494 -36.986 -41.376 1.00 33.54 
10 293 -35.772 -41.793 1-00 37.73 
11.716 -35.880 -41.284 1.00 40.70 
12.580 -34.864 -41.873 1.00 44.36 
13.901 -34.841 -41.739 1.00 45.65 
14 514 -35.782 -41.031 1.00 46.05 
14"608 -33.881 -42.322 1.00 47.38 

7 626 -38.545 -41.854 1.00 30.89 
7 724 -39.387 -42.746 1.00 29.98 
7*086 -38.818 -40.670 1.00 29.72 
6 578 -40.151 -40.384 1.00 28.61 

6 197 -40.298 -38.885 1.00 27.90 
5 612-41.680 -38.625. 1.00 27.30 

7 419 -40.060 -38.024 1.00 27.32 
5 361 -40.435 -41.252 1.00 28.33 
5 248 -41.511 -41.847 1.00 28.30 
4 457 -39.465 -41.335 1.00 28.76 
3249 -39.617 -42.137 1.00 28.96 
2 363 -38.387 -41.994 1.00 28.49 
3 ' 596 -39.844 -43.606 1.00 30.26 
2 948 -40.641 -44.279 1.00 30.12 

4 613 -39.142 -44.098 1.00 32.77 

5 035 -39.286 -45.493 1.00 34.56 
6.045 -38.195 -45.868 1.00 35.68 

5 384-36.846 -46.123 1-00 37.21 

4 157 -36.719 -46.081 1.00 38.48 

6 19/9 -35.831 -46.393 1.00 38.38 
5633 -40.668 -45.765 1.00 34.90 

5 433 -41.233 -46.841 1.00 34.59- 
6*366 -4i.212 -44.794 1.00 35 . 20 1 

6 954 -42.540 -44.956 1.00 34.86 

8 004 -42.814 -43.879 1.00 34.59 

9 404 -42.385 -44.259 1.00 38.16 
'9 865 -42.992 -45.579 1.00 38.74 
9.806 -44.232 -45.735 ■ 1.00. 40.12 

10 292 -42.225 -46.461 1.00 39.64 
5 872 -43.607 -44.895 1.00 33.96 
5*942 -44.621-45.591 1.00 33.99 
4 875-43.377 -44.051 1.00 33.91 
3 767 -44.306 -43.919 1.00 33.79 
2 848 -43.907-42.744 1.00 33.70 

1 554 -44.695 -42.798 1.00 32.22 
3*568 -44.154 -41.425 1.00 32.24 

2 969-44.274 -45.217 l\ 00 34.78 
2.411 -45.285 -45.645 1.00 34.85 
2.925 -43.102 -45.844 1.00 35.34 
2*196 -42.946 -47.095 1.00 36.67 
2024 -41.462 -47.425 1.00 37.32 
1.241 -41.292 -48.590 1.00 37.74. 
2*958 '-43.639 -48.222 1.00 37.35 
2*365 -44.309 -49.064 1.00 36.85 
4*277 -43.482 -48.223 1.00 37.89 

5 114 -44.088 -49.251 1.00 40.03 

6 557 --43.600 -49.114 1.00 41.58 
7*470 -44.069 -50.232 lr.OO 44.46 
8*906 -44.157 -4 9 . 7 5 9 . . 1 . 00 47.36 
9*097-45.271 -48.832 1.00 49.57 

10234 -45.524-48.191.- 1.00 50.54 
11*287 -44.738 -48.372 .1.00 50.88 
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PCT/USO 1/1 1500 



BBBBATOM 
BBBBATOM 
BBBBATOM 
BBBBATOM 
BBBBATOM 
BBBBATOM 
BBBBATOM 
BBBBATOM 
BBBBATOM 
BBBBATOM 
BBBBATOM 
BBBBATOM 
BBBBATOM 
BBBBATOM 
BBBBATOM 
BBBBATOM 
BBBBATOM 
BBBBATOM 
BBBBATOM 
BBBBATOM 
BBBBATOM 
BBBBATOM 
BBBBATOM 
BBBBATOM 
BBBBATOM 
BBBBATOM 
BBBBATOM 
BBBBATOM 
BBBBATOM 
BBBBATOM 
BBBBATOM 
BBBBATOM 
BBBBATOM 
BBBBATOM 
BBBBATOM 
BBBBATOM 
BBBBATOM 
BBBBATOM 
BBBBATOM 
BBBBATOM 
BBBB 
ATOM 
WATR 
ATOM 
WATR 
ATOM 
WATR 
ATOM 
WATR 
ATOM 
WATR 
ATOM 
WATR 
ATOM 
WATR 
ATOM 
WATR 
ATOM 
WATR 
ATOM 
WATR 
ATOM 
WATR 
ATOM 
WATR 



5241 

5242 

5243 

5244 

5245 

524 6 

5247 

524 8 

5249 

5250 

5251 

5252 

5253 

5254 

5255 

5256 

5257 

5258 

5259 

5260 

5261 

5262 

5263 

5264 

5265 

5266 

5267 

5268 

5269 

5270 

5271 

5272 

5273 

5274 

5275 

5276 

5277 

5278 

5279 

5280 



NH2 ARG B 352 



C 
O 
N 

CA 
CB 
CGI 



ARG B 352 

ARG B 352 

VAL B 353 

VAL B 353 

VAL B 353 

VAL B 353 



CG2 VAL B 353 

C VAL B 353 

O VAL B 353 

N ALA B 354 

CA ALA B 354 

CB ALA B 354 

C ALA B 354 

O ALA B 35 4 

N ARG B 355 

CA ARG B 355 

CB ARG B 355 

CG ARG B 355 

CD ARG B 355 

NE ARG B 35 5 

.CZ ARG B 35 5 

NH1 ARG B 355 

NH2 ARG B 355 
C 

o 

N 

CA 
CB 
C 
O 
N 

CA 
CB 
CG 
CD1 

CD2 LEU 
C 

OT1 



ARG B 355 
ARG B 355 
ALA B 35 6 
ALA B 35 6 
ALA B 356 
ALA B 356 
ALA B 356 
LEU B 357 
LEU B 357 
LEU B 357 
LEU B 357 
LEU B 357 
B 357 
LEU B 357 
LEU B 357 



10.321 
5.098 
5. 101 
5.096 
5.089 
5.446 
5 .279 



6. 
3 , 
3. 
2 
1 . 
0 
1 
0 



OT2 LEU B 357 



.879 
.742 
. 679 
. 664 
.336 
.264 
.250 
. 531 

1 . 998 

2 .035 
2. 658 
1 . 998 
0.716 
0. 112 

-1.054 
-1.758 
-1.519 
2.899 
2.966 
3.569 
4.453 
3.669 
5.611 
6.193 
5.942 
7 .023 
6.871 
5.595 
5. 624 
5.485 
8.391 
9.244 
8.601 



-46.566 - 

-45.613 

-46.292 • 

-46. 143 

-47.587 

-47.927 

-49.420 

-47 . 509 

-48.206 

-49.343 

-47.465 

-47.957 

-47.043 

-47 .971 

-48 . 780 

-47 . 068 

-46. 964 

-45. 633 

-44 . 372 

-44.014 

-42.802 

-42 . 296 

-42.896 

-4 1 . 186 

-48.099 

-48 . 319 

-48 , 805 

-49.913 

-51.004 

-49.417 

-50 . 178 

-48.136 

-47 . 522 

-45.997 

-45.422 

-43.903 

-45.859 

-47.895 

-48.359 

-47.710 



-47 . 373 
-49. 176 
-50. 204 
-47.955 
-47 .737 
-46.271 
-46.014 
-45. 987 
-48. 095 
-48 . 556 
-47 . 871 
-48.212 
-47 . 628 
-49.737 
-50 . 324 
-50 . 366 
-51 .824 
-52 .264 
-51 .730 
-52 .461 
-51 . 907 
-52 .299 
-53.252 
-51 .738 
-52 . 366 
-53.575 
-51 .456 
-51 . 809 
-52 . 546 
-52. 671 
-53. 442 
-52 . 526 
-53 . 289 
-53.290 
-53. 915 
-53. 820 
-55.370 
-52 .729 
-53.514 
-51 . 512 



1.00 50.24 
1.00 40.60 
1.00 39.84 
1 .00 41.33 
1.00 42.78 
1.00 42.54 
1.00 42.05 
1 . 00 41.86 
1.00 43,85 
1.00 44 .43 
1.00 45.85 
1 . 00 47.24 



00 46.79 
00 48 . 54 
.00 48.48 
.00 50.49 
.00 52.71 
1.00 53. 58 
1.00 55.08 
00 56.64 
00 57.43 
00 58.24 
00 58 . 53 
1.00 58.87 
1.00 53. 41 
00 53.50 
00 54 . 15 
00 54 . 93 
00 54 . 86 
00 55 . 63 
00 55.95 
1 .00 56.45 
1 .00 57 . 81 
1 
1 
1 
1 
1 
1 
1 



,00 58 . 04 

.00 58.29 

.00 58 . 36 

.00 58. 62 

.00 58.46 

.00 58.97 

.00 58 . 97 



5281 


OH2 


WAT 


W 


1 


-20 . 


568 


11 . 


549 


4 1 . 


653 


1 . 


00 


24 . 


11 


528 2 


OH2 


WAT 


W 


4 


-7 . 


219 


-67 . 


275 


-4 1 . 


843 


1 . 


00 


35. 


35 


5283 


OH2 


WAT 


W 


5 


20. 


119 


-17. 


520 


-22 . 


473 


1 . 


00 


21 . 


13 


5284 


OH2 


WAT 


W 


6 


18 . 


858 


-19. 


701 


-23. 


468 


1 . 


00 


18 . 


31 


5285 


OH2 


WAT 


W 


7 


2 . 


329 


-28 . 


724 


-15 . 


978 


1 . 


00 


2 7 . 


32 


5286 


OK2 


WAT 


W 


8 


9 . 


.484 


-48 . 


,435 


-27 . 


, 938 


1 . 


.00 


23 . 


67 


5287 


OH2 


WAT 


W 


9 


7 . 


. 645 


-57 . 


. 693 


-27 , 


. 177 


1 


.00 


21 . 


.03 


5288 


OH2 


WAT 


W 


10 


-1 


.542 


-8 


. 422 


1 


.824 


1 


.00 


29 


.71 


5289 


OH2 


WAT 


W 


11 


5 


.875 


-50 


.793 


-32 


. 396 


1 


. 00 


20 


. 21 


5290 


OH2 


WAT 


W 


12 


27 


.592 


-18 


. 174 


-27 


.779 


1 


.00 


22 


.52 


5291 


0H2 


WAT 


W 


13 


7 


. 842 


-13 


.432 


-21 


. 173 


1 


.00 


25 


.85 


5292 


OH2 


WAT 


W 


14 


4 


.845 


-57 


. 924 


-27 


.444 


1 


. 00 


24 


.35 
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ATOM 
WATR 
ATOM 
WATR 
ATOM 
WATR 
ATOM 
WATR 
ATOM 
WATR 
ATOM 
WATR 
ATOM 
WATR . 
ATOM 
WATR • 
ATOM 
WATR 
ATOM 
WATR ' 
ATOM 
WATR 
ATOM 
WATR 
. ATOM 
WATR 
ATOM 
WATR 
- ATOM 
WATR 
ATOM 
. WATR 
. ATOM 
WATR 
■ ATOM 
WATR 
ATOM 
'WATR 
ATOM 
WATR 
. ATOM 
WATR 
ATOM. 
WATR 
ATOM 
WATR 
ATOM 
WATR 
ATOM . 
WATR ' 
ATOM 
WATR 
ATOM 
WATR 
ATOM . 
WATR 
ATOM. 
WATR 
'ATOM 
WATR 
ATOM 
WATR 
ATOM 
WATR 
ATOM 
WATR 



5293 OH2 WAT W 15 

5294 OH2 WAT W " 16 

5295 OH2 WAT W 17 

5296 OH2 WAT W** 18 

5297 . OH2 WAT W 19 

5298 OH2 WAT W 2 0 

5299 OH2 WAT W 21 

5300 OH2 WAT W 22 

5301 OH2 WAT W 23 
5 302 OH2 WAT W 2 4 
5 30 3 OH2 WAT W 2 5 

5304 OH2 faAT W 2 6 

5305 OH2 WAT W 27 
.5306 OH2 WAT W 28 

5307 OH2 WAT W 29 

5308 OH2 WAT W ' 30 

5309 .OH2 WAT W 3*1 

5310 ..OH2 WAT "w 32 
5 311 „OH2 WAT W 3 3 
5312 ,' OH2 WAT W 34 

. ^5313 3 OH2 WAT W . 35 

5314 OH2 WAT W 36 

5315 OH2 WAT W 37 
5316. OH2 WAT W. 38 

5317 OH2 WAT W 39 

5318 OH2 WAT W 4 0 

5319 OH2 WAT W 41 
/ 5320 OH2 WAT W 42 

532 1 OH2 WAT W 43 
532 2 OH2 WAT W 4 4 
5 32 3 OH2. .WAT W 4 5 

5324 .OH2 WAT W 46 

5325 OH2 WAT W 47 



PCT/US01/U500 

0.473 -58.751 -17.746 1-00 35.49 
7.998 -52.522 -25.785 1.00 22.34 
-8.656 11.300 18.872 1.00 23.81 
8.711 -45.913 -29.121 1.00 21.55 
2.957 -68.158 -38.242 1.00 29.43 
16.486 -11.742 -16.567 1.00 22.13 
-6.251 17.702 28.534 1.00 24.24 
12.670 -47.636 -24.808 1.00 25.87 
6.513 -15.597 -22.517 1.00 26.31 
7.536 -66.906 -21.753 1.00 21.48 
-29.060 13.621 26.406 1.00 21.08 
-5.240 10.154 13.527 1.00 29.62 
29.942 -20.139 -19.237 1.00 20.38 
18.996 -28.763 -24.427 1.00 20.28 
8.755 -51.080 -27.990 1.00 20:66 
4.215-64.634 -43^328 1.00 39.67 
14.703 -11.936-1.749 1,00 24.57 
23.140 -13.870 -21.266 1.00 18.93 
4 .057 -1 .221 9.809 1 .00 32.30 
4.784 -56.759 -43.904 1.00 25'.99 
-22.733 10.283 33.238 '1.00 24*.60 
0.540 14.225 10.932 1.00*26.89 
-7.560 11.931 12.593 1.00 27.76 
-7.966 17.043 30.555 1.00 20.04 
6.716 -55.314 -42.959 1.00 25.72 
6.833 -32.402 -3.845 1. 00*32. 49 
30.445 -20.104 -25.459 1.00 27.97 
1.475 -15.304 -22.123 1.00 30.57 
15.703 -42.835 -31.237 1.00 26.74 
7.131 -6/595 -18.003 1.00 29.47 
30.256 -23.202 -11.163 1.00 33.31 
-6.107 -66.004 -38.690 1.00 30.45 
17.631 -17.241 -5.364 1.00 28.69 
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ATOM 5326 OH2 WAT W 4 8 
WATR 

ATOM 5327 OH2 WAT W 4 9 
WATR 

ATOM 5328 OH2 WAT W 50 
WATR 

ATOM 5329 OH2 WAT W 51 
WATR 

ATOM 5330 OH2 WAT W 52 
WATR 

ATOM 5 331 OH2 WAT W 5 3 
WATR 

ATOM 5332 OH2 WAT W 54 
WATR 

ATOM 5333 OH2 WAT W 55 
WATR 

ATOM 5334 OH2 WAT W 56 
WATR 

ATOM 5335 OH2 WAT W 57 
WATR 

ATOM 5336 OH2 WAT W 58 
WATR 

ATOM 5 3 37 OH'2 WAT W 5 9 
WATR 

ATOM 5338 OH2 WAT W 60 
WATR 

ATOM 5339 OH2 WAT W 61 
WATR 

ATOM 5340 OH2 WAT W 62 
WATR 

ATOM 534 1 OH2 WAT W 63 
WATR 

ATOM 534 2 OH2 WAT W 64 
WATR 

ATOM 534 3 OH2 WAT W 65 
WATR 

ATOM 5344 OH2 WAT W 66 
WATR 

ATOM 534 5 OH2 WAT W 67 
WATR 

ATOM 534 6 OH2 WAT W 68 
WATR 

ATOM 534 7 OH2 WAT W 69 
WATR 

ATOM 5 34 8 OH2 WAT W 7 0 
WATR 

ATOM 5349 OH2 WAT W 71 
WATR 



ATOM 5 3 5 0 


OH2 


WAT 


W 72 


WATR 










ATOM 5 351 


OH2 


WAT 


W 73 


WATR 










ATOM 5 352 


OH2 


WAT 


W 74 


WATRATOM 


5353 


OH2 • 


WAT 


W 


WATRATOM 


5354 


OH2 


WAT 


W 


WATRATOM 


5355 


OH2 


WAT 


W 


WATRATOM 


5356 


OH2 


WAT 


W 


WATRATOM 


5357 


OH2 


WAT 


w 


WATRATOM 


5358 


OH2 


WAT 


w 


WATRATOM 


5359 


OH2 


WAT 


w 


WATRATOM 


5360 


OH2 


WAT 


w 


WATRATOM 


5361 


OH2 


WAT 


w 


WATRATOM 


5362 


OH2 


WAT 


w 


WATRATOM 


5363 


OH2 


WAT 


w 


WATRATOM 


5364 


OH2 


WAT 


w 


WATRATOM 


5365 


OH2 


WAT 


w 



PCT/US01/11500 

13.436 -59.821 -27.855 1-00 28.71 

11.395 -15.293 2.925 1.00 29.36 

19.218 -21.377 -34.248 1.00 31.41 

11.973 -11.890 -15.788 1.00 35.59 

9.140 -8.260 -23.371 1.00 40.29 

-19.061 14.438 38.966 1.00 38.01 

-3.895 -70.510 -28.249 1.00 25.56 

20.909 -23.456 -34.213 1.00 26.74 

-8.505 11.136 10.323 1.00 30.97 

22.022 -15.529 -23.223 1.00 26.74 

13.860 -49.304 -42.490 1.00 27.91 

-9.455 -6.552 -9.720 1.00 41.70 

13.798 -49.732 -23.016 1.00 41.60 

15.881 -60.461 -31.910 1.00 48.66 

-9.797 12.718 13.997 1.00 29.14 

16.793 0.356 -6.115' 1.00 26.17 

3.173 18.778 20.793 1.00 31.13 





13. 


433 


-11. 


079 


0. 


672 


1 .00 


27.40 






3. 


118 


-0. 


813 


0. 


729 


1 . 00 


24 . 66 






-22 . 


179 


3. 


583 


26. 


97 8 


1.00 


32.28 






24 . 


433 


-30. 


481 


-1 . 


783 


1.00 


41.91 






4 . 


384 


-66. 


131 


-41 . 


203 


1.00 


34 .75 






20. 


, 398 


-7 . 


386 


-4 . 


280 


1.00 


30.85 






-2 , 


.444 


-70. 


752 


-22 . 


067 


1.00 


25.46 






-3 


. 963 


-4 . 


914 


-5. 


711 


1 . 00 


29.67 






17 


. 663 


-11. 


040 


-34 . 


488 


1 . 00 


30.2 4 






21 


.404 


-42 . 


,041 


-26. 


. 621 


1 .00 


31.41 




75 




-1 . 


110 


-15 . 


319 


-1 . 


089 


1 .00 31 . 


51 


76 




0 . 


688 


19. 


730 


22 . 


519 


1 .00 26. 


71 


77 




12 . 


, 113 


-69. 


335 


-26. 


593 


1 . 00 27 . 


11 


78 




11 . 


.725 


-25. 


,065 


-33 . 


817 


1 .00 43 . 


79 


79 




-25 


. 519 


9. 


, 450 


22 . 


092 


1 .00 28 . 


18 


80 




-14 


. 673 


6 


. 584 


16. 


,023 


1 .00 31 . 


37 


81 




-2 


.250 


-0 


.253 


-1 , 


.741 


1.00 29. 


. 99 


82 




-7 


. 300 


12 


. 943 


8 


.415 


1.00 32, 


. 39 


83 




1 


.712 


-13 


. 629 


-13 


. 904 


1 .00 34 


. 08 


84 




4 


.709 


-17 


. 478 


-6 


. 557 


T.OO 2 9 


. 67. 


85 




10 


.070 


-57 


.496 


-44 


.450 


1 .00 4 8 


. 39 


86 




8 


.040 


-30 


.281 


-10 


.117 


1.00 28 


. 59 


87 




-1 


.967 


-32 


.372 


-38 


.643 


1.00 31 


. 55 
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WATRATOM 


5366~ 


OH2 


WAT 


W 


88 


WATRATOM 


5367 


OH2 


WAT 


W 


89 


WATRATOM 


5368 


OH2 


WAT 


W 


90 


WATRATOM 


5369 


OH2 


WAT 


w 


91 


WATRATOM 


5370 . 


OH2 


WAT 


w • 


92 


WATRATOM 


5371 


OH2 


WAT 


w 


93 


WATRATOM 


5372 


OH2 


WAT. 


w 


94 


WATRATOM 


5373 


OH2 


WAT 


w 


95 


WATRATOM 


5374 , 


OH2 


WAT 


w 


96 


WATRATOM 


5375 


OH2 


WAT 


w 


97 


WATRATOM 


5376 


OH2 


WAT 


w 


98 


WATRATOM 


5377 


OH2 


WAT 


w 


99 


WATRATOM 


5378 


OH2 


WAT 


w 


100 


WATRATOM 


5379 


OH2 


WAT 


w 


101 


WATRATOM 


5380 


OH2 


WAT 


w 


102 


WATRATOM 


5381 


OH2 


WAT 


w 


103 


WATRATOM 


5382 


OH2 


WAT 


w 


104 


WATRATOM 


5383 


OH2 


WAT 


w 


105 


WATRATOM 


5384 


OH2 


WAT 


w 


106 


WAT RATOM 


5385 


OH2 


WAT 


w 


107 


WATRATOM 


5386 


OH2 


WAT 


w 


108 


WATRATOM 


5387 


OH2 


WAT 


w 


109 


WATRATOM 


5388' 


' OH2 


WAT 


w 


110 


WAT RATOM 


5389 


OH2 


WAT 


w 


111 


WATRATOM 


5390 


OH2 


WAT 


w 


112 


WATRATOM 


5391 


OH2 


WAT 


w 


113 



WATRATOM 


5392 


OH2 


WAT 


W * 


114 


WATRATOM 


5393 


OH2 


WAT 


w 


115 


WATRATOM 


5394 


OH2 


WAT 


w 


116 


WATRATOM 


5395 


OH2 


WAT 


w 


117 


WATRATOM 


5396 


OH2 


WAT 


w 


118 


WATRATOM 


5397 


OH 2 


WAT 


w 


119 


WATRATOM 


5398 


OH2 


WAT 


w 


120 


WATRATOM 


5399 


OH2 


WAT 


w 


121 


WATRATOM 


5400 


OH2 


WAT 


w 


122 


WATRATOM 


5401 


OH2 


WAT 


w 


123 


WATRATOM 


5402 


OH2 


WAT 


w 


124 ■ 


WAlT RATOM 


5403 


OH2 


WAT 


w 


125 


WATRATOM 


5404 


OH2 


WAT 


w 


126 


WATRATOM 


5405 


OH2 


WAT 


w 


127 


WATRATOM " 


5406 " 


' OH2 


WAT 


w 


128 


WATRATOM 


5407 


OH2 


WAT 


w 


129 


WATRATOM 


5408 ' 


OH2 


WAT 


w 


130 


WATRATOM 


5409 


OH2 


WAT 


w 


131 


WATRATOM 


5410 


OH2 


WAT 


w 


132 


WATRATOM 


5411 


OH 2 


WAT 


w 


133 


WATRATOM 


5412 


' OH2 


WAT 


w 


134 


WATRATOM 


5413 


OH2 


WAT 


w 


135 


WATRATOM 


5414 


OH2 


WAT 


w 


136 


WATRATOM 


5415 


OH2 


WAT 


w 


137 


WATRATOM 


5416 


OH2 


WAT 


w 


138 


WATRATOM ■ 


54 17 


OH2 


WAT 


w 


139 


WATRATOM 


5418 


OH2 


WAT 


w 


140 


WATRATOM 


5419 


OH2 


WAT 


w 


141 


WATRATOM 


5420 


OH2 


WAT 


' w 


142 


WATRATOM . 


.54 2 1 


OH2 


WAT 


w 


143 



WATRATOM- 54 22 OH 2 WAT W 14 4 

WATRATOM 54 2 3 OH 2 WAT W 14 5 

WATRATOM 54 24 OH2 WAT W 14 6 

WATRATOM 5425" OH2 WAT W 147 

WATRATOM 54 26 OH2 WAT W 148 

WATRATOM 5427 OH2 WAT ■ W 149 

WATRATOM 5428 OH2 WAT W 150 

WATRATOM ." 54 29 OH2 WAT W 151 

WATRATOM ■ ' 5430 OH2 WAT W 152 

WATRATOM 54 31 OH2 WAT W 153 




PCTYUSO 1/11500- 



-3.178 -64.576 -25.506 .1.00 33.66 
15.762 -9.860 -10.400 1.00 34.29 
5.654 -30.990 -27.758 1.00 35.84 
14.959 -46.096 -42.270 1.00 35.21 
10.137 -13.308 -9.753 1.00.32.65 
-4 .480 6.624 6.614 1.00 27.37 
-14 574 22.522 18.870 1.00 49.48 
-11.031 -42.768 -39.637. 1.00 37.63 

9 906 -35.479 -19.203 1.00 29.59 
-0.990 -0.781 36.951 1.00 37.27 

-11 422 9.059 11.252 1.00 30.43 
8 118 -36.710 -22.371 1.00 34.85 
12 414 -67.326 -22.791 1-00 31.24 
28 541 -24.603 -31.049 1.00 40.14 
16.276 -10.934 -3.673 1.00 33. SO 
30 979 -13.264 -22.953 1.00 24.73 
12 759 -31.636 -31.838 1.00*26.40 
23.507 -29.661 -32.187 1.00 29.62 
21 292 -13.141 -24.874 1.00 37.83 

10 171 -32.960 -15.580 1.00 26.59 
-2 207 2.376 ■ 2.034 1.00 27.55 
-6 984 22.583 16.082 1.00 39.01 
14 308 -11.038 -15.406 1.00 26.48 
10 612 9.749 24.161 1.00 40.18 

2 406 -15.823 -10.196 1.00 27.61 
6 310 -68.984 -20.661 1.00 40.25 
14 379 -10.930 -8.565 1.00 35.86 
24.183 -35.475 -30.338 1.00 35.1.7 
21 897 -31.717 -1.243 1.00 47.61 
24.065 -17.545 -13.707 1.00 32.24 

16 772 -51.926 -25.940 1.00 38.66 
-5 862 6.629 4.446 1.00 39.57 

14 133 -57.303 -28.159 1.00 30.19 
-16 538 -6.724 21.638 1.00 40.50 

19 669 -18.487 -33.216 1.00 38.38 

15 481 1-078 -4.048 1.00 32. 29; 
20.395 -13.033 -2.072 1.00 49.69 
15 526 -1.437 -15.842 1.00-36.06 

7 297 -29.419 -1.509 1.00 34.75 
9 994 -12.069 -21.013 1.00 32.21 

17 433 -42.825 -16.713 1.00 31.02 
-15.855 20.882 21.019 1.00 44.11 

-6 351 -7.687 -14.067 1/00 35.04 
7 954 -17.872 -1.475 1.00 28.45 
13.526 -34.593 -31.844 1:00 31.12 
9 992 -41.228 -23.098 1.00 26.45 
8.434 18.132 .16.019 1:00 32.97 
-1 208 -33.658 -36.216 1.00 38.33 
-14 502 9.100 12.433 1.00 43.21 
14 394-43.675 -17.325 1.00 32.32 
-4 809 -30.333 -46.416 lVOO 42.65 
18.861 -35.072 --35.671 1.00 43.56 
-10 162 -60.139 -32.862 1.00 35.41 
6" 740 -32.411 -35.303- 1/00 33.57 
-12 257 -60.854-39.307 1.00 32.90 
18 910 -40.984 -13.084 1.00 43.43 
18 857 -49. 375" -28 . 645 1.00 31.34 
0 235 -17.424 -16.608 I'.OO 38.85 
14 236 -11.252 -24.086 ' 1.00 27.79 
31 513 -22.336-22.128 1:00 43.18 
-5 314 -70.396 -26.090 '1.00 43.91 
-7.717 -64.969 -36.308 1.00 26.30 
• 22 . 584 -1,2. 594 "' -4.179 1.00 46.91 
-12 388 ■ 9.493 36.619 '1.00 32.82 
-14 517 '16.479 .37.760' 1.00 39.52 
-10 095 -34. 64'7 -29.068 • 1.00 41.08 
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PCT/USOl/11500 



WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 



5432 
5433 
5434 
5435 
5436 
5437 
5438 
5439 
5440 
544 1 
5442 
5443 
5444 
5445 
544 6 
5447 
54 48 
5449 
5450 
5451 
5452 
5453 
5454 " 
5455 
5456 
5457 
5458 
5459 
5460 
5461 
5462 
5463 
5464 
5465 
5466 
5467 
5468 
54 69 
5470 
5471 
5472 
5473 
5474 
5475 
5476 
5477 
5478 
5479 
5480 
5431 
5482 
5483 
5484 
5485 
5486 
5 4 87 
5488 
5489 
5490 
5491 
5492 
5493 
5494 
5495 
5496 
5497 



OH2 WAT W 154 
OH2 WAT W 155 
OH2 WAT W 156 
OH2 WAT W 157 
OH2 WAT W 158 
OH2 WAT W 159 
OH2 WAT W 160 
OH2 WAT W 161 
OH2 WAT W 162 
OH2 WAT W 163 
OH2 WAT W 164 
OH2 WAT W 165 
OH2 WAT W 166 
OH2 WAT W 167 
OH2 WAT W 168 
OH2 WAT W 169 
OH2 WAT W 170 
OH2 WAT W 171 
OH2 WAT W 17 2 
OH2 WAT W 173 
OH2 WAT W 17 4 
OH2 WAT W 17 5 
Grt2 WAT W 17 6 
0H2 WAT W 177 
OH2 WAT W 178 
OH2 WAT W 17 9 
OH2 WAT W 180 
OH2 WAT W 181 
OH2 WAT W 182 
OH2 WAT W 18 3 
OH2 WAT W 184 
OH2 WAT W 18 5 
OH2 WAT W 186 
OH2 WAT W 187 
OH2 WAT W 188 
OH2 WAT W 189 
OH2 WAT W 190 



OH2 WAT W 191 
OH2 WAT W 192 
OH2 WAT W 193 
OH2 WAT W 194 
OH2 WAT W 195 
OH2 WAT W 196 
OH2 WAT W 197 
OH2 WAT W 198 
OH2 WAT W 199 
OH2 WAT W 200 
OH2 WAT W 201 
OH2 WAT W 202 
OH2 WAT W 203 
OH2 WAT W 204 
OH2 WAT W 205 
OH2 WAT W 206 
0H2 WAT W 207 
OH2 WAT W 208 
OH2 WAT W 209 
OH2 WAT W 210 
OH2 WAT W 211 
OH2 WAT W 212 
OH2 WAT W 213 
OH2 WAT W 214 
OH2 WAT W 215 
OH2 WAT W 216 
OH2 WAT W 217 
OH2 WAT W 218 
0H2 WAT W 219 



-5.233 
-6. 322 
10.262 
22.929 - 
-15.987 
13.385 - 
26.508 - 
4.671 - 
-12.589 

13.899 - 
-31 .053 
9.797 - 
0.877 - 
-17 .088 

0.855 - 
-14 .873 
11 .913 - 
25.783 • 

7 . 169 
20.074 

8 .765 
-4 . 953 
11 .889 

2.149 
-14 .051 
8 .488 
-1.152 
-7 .002 
-12.320 
6.790 
26.305 
20 . 402 
8.061 
-7.549 
-31 .099 
28 .566 
-5. 613 
17 .024 
-22 .114 
10.204 
27 .893 
-5.582 
23.004 
5 . 189 
-7.740 
-8.156 
23.286 
30.646 
-8.238 
19.370 
0.885 
16.084 
-0. 698 
0. 682 

1 . 646 
-21.611 

-5.143 
-9. 645 
22.096 
-5.759 
-4 . 323 
-5. 450 

2 . 330 
-26.827 

10 .887 
-9.020 



-4 . 134 
11.278 
-9.572 - 
10.414 - 

3. 994 
44.923 - 
13.616 - 
66.907 - 
12 . 262 
•62.269 - 
15.663 
-47.899 - 
-51.774 - 

16.246 
-52.086 - 

18 . 295 
-62.134 - 
-23.984 • 
-50.047 
-42.845 
5 . 909 
-64 . 494 
-61.263 
-49. 169 
6.399 
-46.760 
-23.348 
3.531 
-54 .772 
-54 .559 
-38 .240 
-58 . 179 
-31 . 341 
-15.619 
11 . 941 
-25.441 
-40. 109 
-13.428 
10. 176 
-29. 330 
-25.793 
17 . 681 
-45. 224 
-58.857 
56. 165 



31 . 160 
-1.883 * 
16.736 
23.566 
16.559 
46.826 
18 .049 
■17 .861 
11.825 
•25. 144 
19.272 
-25.140 
-25. 619 
37 . 180 
-22 .078 
21 . 203 
-35. 641 
-33. 162 
-23.737 
-14 . 939 
9. 193 
-45.351 
-22.531 
-24 .836 
13.353 
-23. 118 
-11 . 975 
7.051 
-29 . 990 
-47 .733 
-19.177 
-34 . 391 
-19.653 
-5. 482 
25. 471 
-15. 103 
-50. 158 
1.709 



24 .723 
-32. 333 
-14 . 180 

-4 . 609 
0.814 
-27 . 619 
-56. 649 
-19. 360 
-14 . 985 
' 17 .427 

1 . 533 
-55. 137 

13.045 
-11.242 

2 . 610 
7 .731 
7 . 197 

-32 . 635 
16.219 

-51 .427 
19.746 



37 . 673 
-20 . 066 
-21 .862 
32 .898 
-27.870 
-25.016 
-24 .052 
26. 733 
-33. 400 
-20.528 
29.299 
-7 .532 
-1.442 
-26. 382 
-9.869 
-15.794 
31 . 991 
20. 359 
-45.825 
37 . 660 
-30.224 
-15.934 
-7 . 331 
10 . 632 
-34 . 500 
13.451 
-23.191 
-3 . 698 



1.00 35.31 
1 . 00 35 . 75 
1 . 00 42.40 
1.00 36.66 
1 . 00 37 . 22 
1 . 00 41.55 
1.00 25.93 
1 . 00 31 . 54 
1.00 32.71 
1.00 30.71 
1 . 00 30 . 19 
1.00 26.79 
1.00 30.02 
1.00 25.63 
1 . 00 40.99 
1 . 00 40.28 
1 . 00 41.33 
1 . 00 44.03 
1.00 47 .85 
1 . 00. 32 .87 
1 . 00 34 . 30 
1 . 00 47.11 
1.00 36.63 
1.00 39.21 
1 .00 39.89 
1 . 00 45.24 
1 . 00 30. 36 
1 . 00 44.50 
1.00 38.61 
1 . 00 44.05 
1.00 39.53 
1 .00 46.11 
1.00 41.37 
1.00 40 . 60 
1.00 38.80 
1 . 00 34.75 
1.00 49.21 
1 . 00 39. 93 
1 . 00 32 . 53 
1 . 00 27 . 24 
1.00 38 . 97 
1.00 43. 64 
1.00 43.04 
1 . 00 28 . 15 
1.00 35.98 
1 . 00 45.54 
1.00 38 . 12 
1 . 00 31-01 
1 . 00 39 . 76 
1 .00 35.83 
1\00 47.04 
1.00 4 5.97 
1 . 00 37 . 53 
1.00 33.35 
1.00 40.39 
1.00 31.04 
1.00 30.17 
1.00 42.93 
1 . 00 48.12 
1 .00 4 1 . 67 
1.00 37.92 
1 . 00 33.99 
1.00 38.35 
1 . 00 38 . 12 
1 . 00 39. 81 
1 .00 46. 58 
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WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
. WATRATOM 
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WATRATOM - 
• WATRATOM 
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WATRATOM 
WATRATOM 
WATRATOM 
WATRATOM 
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5498 
5499 
5500 
5501 
5502 
5503 
5504 
5505 
5506 
5507 
5508 
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5510 
5511 
5512 
5513 
5514 
5515 
5516 
5517 
5518 
5519 
5520 
5521 
5522 
5523 
5524 
5525 
5526 
5527 
5528 
5529 
5530 
' 5531 
5 5 32 
5533 
5534 
5535 
5536 
5537 
5538 
. 5539, 
5540 
5541 
5542 
5543 
. 5544 
5545 
5546 
5547 
5548 
5549 
5550 
5551 
5552 
5553 



OH2 WAT W 220 
OH2 WAT W 221 
OH2 WAT W 222 
OH2 WAT W 223 
OH2 WAT W 22 4 
OH2 WAT W 225 
OH2 WAT W 226 
OH2 WAT' W 227 
OH2 WAT W 228 
OH2 WAT W 229 
OH2 WAT W 230 
OH2 WAT W 231 
OH2 WAT W 2 32 
OH2 WAT W 233 
OH2 WAT W 234 
OH2 WAT W 235 
OH2 WAT W 236 
OH2 WAT W 2 37 
OH2 WAT W 238 
OH2 WAT W 239 
OH2 WAT W 24 0 
OH2 WAT W 241 
' OH2 WAT W 24 2 
OH2 WAT W 24 3 
OH2 WAT W 24 4 
OH2 WAT W 2 45 
OH2 WAT W 24 6 
OH2 WAT W 24 7 
OH2 WAT W 248 
OH2 WAT W 24 9 
OH2 WAT W 250 
OH2 WAT W 251 
OH2- WAT W 252 



5554 

5555 

5556 

5557 

5558 

5559 

5560 

5561 

5562 

5563 



OH2 WAT W 253 
OH2 WAT W 254 
OH2 WAT W 2 55 
OH2 WAT W 256 
OH2 WAT W 257 
OH2 WAT W 258 
OH2 WAT W .259 
OH2 WAT W 260 
OH2 WAT W 261 
-OH2- WAT W 262 
OH2 WAT W 263 
OH2 WAT W 264 
OH2 WAT W 265 
OH2 WAT W 266 
• OH2 WAT W 267 
: OH2 WAT W 268 
OH2 WAT W 269 
OH2 WAT W 270 
OH2 WAT W 271 
OH2 WAT W 272 
OH2 WAT W 27 3 
OH2 WAT W 27 4 
OH2 WAT W 275 
OH2 WAT W -276 
OH2 WAT" W 277 

• OH2 WAT W 278 
OH 2 WAT W 27 9 
OH2 WAT W 280 
OH2 WAT W- 281 

• OH2 WAT W 282 
OH2 WAT W 283 

' OH2 WAT W 28 4 
OH2~WAT W 285 



9.054 
4 . 173 
0.983 
2.222 
13.627 
12.533 
1 .404 
31. 159 
-13.047 
8.956 
28.749 
-4 .461 
-9.795 
10. 673 
-15. 694 
3. 345 
-6.256 
28. 344 
2.876 
15.355 
27.066 
2.222 
15.052 
10.351 
-13. 173 
-7.569 
-2. 167 
7. 166 
-11.231 
5.684 
-4.745 
-0.505 
16. 668 
• 2.454 
-8.367 
15.-642 
13. 660 
11.567 
18 . 941 
-11.441 
28.664 
■ 6.795 
7.077 
-2.259 
21.812 
-27.570 
13.573 
16.549 
-9.142 
5.648 
3 . 619 
-8.129 
-17 . 342 
8 .798 
9. 190 
13.545 
-7.844 
0.478 
24 . 658 
-4 .675 
0.382 
5.91-9 

-2 .987 
8 .990 
0.155. 
-10 .443 



6. 622 
-7.985 - 
16. 806 
16.848 
1.072 
14 .212 
-7 .852 
•24 .354 
•60.728 
-37 . 681 
-13.637 
19.451 
-66.504 . 
-41 .619 
1 . 684 
1 .229 
-68 .913 
-21 .326 
-34 .368 
-11 .202 
-22.336 
18 .464 
-9.829 
-67.649 
14 .269 
9. 658 
-47.395 

2. 400 
-10.901 
-16.094 

3. 667 
-22. 136 
-37 . 987 
-18.256 
-39. 960 

7.805 
-24.331 
-6.104 
-16. 698 
-63.514 
-39. 605 
-6.961 
-14 . 349 
-48.991 
-44 .128 
4 . 389 
-27 . 185 
8.451 
9. 107 
-11.797 
-14 . 850 
-11.098 
. 8. 563 
-36. 348 
-10. 509 
-13.441 
0.944 
-4 7.721 
-18.359 
21.561 
20 . 486 
18 .010 
-63.751 
-33. 134 
-61 .872 
-56. 965 



-0 .709 
23.786 
3.910 
-6.783 
15.114 
-9.007 
5. 396 
-31. 143 
-42.282 
-16.765 
-16. 860 
8 . 684 
-35.701 
-20. 678 
32.613 
9. 738 
-30 .401 
-30. 399 
-17 . 344 
2 . 371 
-6. 437 



26.994 
-31.019 
-21. 184 
38 . 605 
0.793 
-19. 605 
15.830 
-10.057 
-26.796 
-18 . 932 
-9.079 
-7 .767 - 
-26. 130 
-21 . 638 ■ 
• 9.633 
1. 932 
-23.359 
0.528 
-39. 126 
-22.853- 
' 31.114 • 
-24 .858 
-29.099 
-35 . 641 
13.296 
0.220 
-13.582 
36.872 
-24 .893 
-23. 652 
-16.064 
9. 97 9 
-46. 119 
-35.865 



3 
-2 
-55 



898 
.560 
. 170 



-11 . 005 
12 . 155 
4 . 930- 
25. 033 
-22. 983 
-17 .898 
-48 . 384 
-24 . 681 



1 . 00 34.14 
1.00 32.75 
1.00" 40.83 
1 . 00 33 . 50 
1.00 37.51 
1.00 38.40 
1.00 38.55 
1.00 37.67 
1 .00 42 . 18 
1 . 00 41.45 
1.00 42.34 
1.00 36. 17 
1.0044.07 
1.00 36.58 
1.00 44.04 
1.00 35.70 
1.00 36.72 
1.00 36.45 
1.00 42.48 
1.00" 38 .84 
1.00 37.37 
1.00 35.75 
1.00 44.31 
1.00 35.79 
1 . 00' 41.50 
1 .00 37 . 62 
1 . 00 45 . 90 
1 . 00 42 . 90 
1.00 45.28 
1 . 00 44.76 
1.00 4-6.20 
1.00 42 .89 
- 1 .00 -35.76 
1 .00- 43. 33 
: 1.00 43.07 
1 . 00 47.78 
1.00 42.50 
1.00 37.10 
1.00 40.97 
1 .00 43. 17 
1 . 00 42 . 65 
1.00 "38.28 
1.00 41.00 
1 .00 34 . 96 
1 .-00 44.51 
1 . 00- 48 . 63 
1 . 00 43.56 
1 . 00' 44.84 
1.00 37.66 
1 . 00 45.79 
1\ 00- 34 .09 
1 . 00 39 . 37 
1.00 4 6.38 
- 1.00 37 .71 
1.00 45.80 
1.00 4 2.83 
1.00 46.27 
1.00 46.25 
1.00 36.33 
1.00 37.17 
1-.00 4 1.40 
1.00 4 1.72 
1.00 4 3.76 
1 ; 00 40 . 17 
1.00 4 9.87 
1 . 00 43.02 
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ATOM 557 7 S 


S04 


S 






SQ4 












ATOM 5578 01 


S04 


S 






S04 












ATOM 5579 02 


S04 


s 






S04 












ATOM 5580 03 


S04 


s 






S04 




£o4 








ATOM 558 1 04 


s 






S04 












TEREND 













18 . 915 
-16. 181 
7 . 197 
31.934 
-15.232 
9. 450 
-1 . 800 
-7.766 
7 . 973 
23.449 
-3. 537 
28 . 052 
20 . 655 



-33.048 
11 .706 
7 . 180 
-26. 155 
-0.248 
-27 . 963 
13. 139 
5.988 
4 .338 
-40.563 
-28 .260 
-32.620 
-43 . 315 



-3 . 930 
12 . 277 
10 . 953 
-26 . 053 
11.315 



1 . 00 
1 . 00 



-1 

-9 
9. 
14 



. 396 
. 983 
.798 
. 321 



-27 . 347 
-15 . 925 
-12 . 168 
-28 .829 



00 
00 
00 
00 
00 
00 
00 
1.00 
1. 00 
1 . 00 
1 . 00 



37 . 81 
41.77 
46.19 

38 .77 
40.14 
41.29 

41 . 60 
40.11 

39 . 97 

40 .59 

42 . 10 
48 .03 
40 . 17 



1.273 -70.953 -23.009 1.00 22.99 

1.720 -71.882 -24.053 1.00 21.18 

0.908 -69.659 -23.626 1 . 00 22.47 

2.337 -70.752 -22.018 1.00 23.88 

0.088 -71.522 -22.328 1.00 22.50 
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TABLE 2 ATOMIC COORD FN ATES OF E.COLI MURG C- ALPHA 



BACKBONE ATOMS 



ATOM 

ATOM 

ATOM • 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM. 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 



2649 
2651 
2662 
2670 
2678 
2685 
2693 
2698 
2702 
2706 
2713 
2717 
2721 
2731 
2738 
2750 
2756 
2760 
2768 
2773 
2780 
2785 
2795 
2805 
2813 
2821 
2826 
2835 
2839 
2853 
2862 
2869 
2880 
2894 
2902 
2906 
2913 
2918 
2926 
2 937 
2945 
2954 
2959 
2967 
2975 
2983 
2989 
2998 
3008 
3012 
3020 
3029 
3037 
3045 
3056 
3064 
3075 
3083 
3089 
3093 
3101 



CA. 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
" CA 
CA 
CA\ 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA . 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA. 
CA 
CA 
CA 
CA 
CA 
CA 



LYS B 
ARG B 
LEU B 
MET B 
VAL B 
MET B 
ALA B 
GLY B 
GLY B 
THR B 
GLY B 
GLY B 
HIS B 
VAL B 
P.HE B 
PRO B 
GLY .B 
. L£U B 
ALA B 
VAL B 
ALA B 
HIS B 
HIS B 
LEU B 
MET B 
ALA B 
GLN B 
GLY B 
TRP B 
GLN B 
VAL B 
ARG B 
TRP B 
LEU B 
GLY B 
THR B 
ALA B 
ASP B 
ARG B 
MET B 
GLU B 
ALA B 
ASP B 
LEU B 
VAL B 
PRO B 
LYS B 
HIS B 
GLY B 
ILE B 
GLU B 
ILE B 
ASP B 
PHE B 
ILE B 
ARG B 
ILE B 
SER B 
GLY B 
LEU B 
ARG B 



7 
8 
9 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 
26 
27 
28 
29 
30 
31 
32 
33 
34 
35 
36 
37 
38 
39 
40 
41 
42 
43 
44 
45 
46 
47 
48 
49 
50 
51 
52 
53 
54 . 
55 
56 
57 
58 
59 
60 
61 
62 
63 
64 
65 
66 
67 



-6.512 -45.403 -47.519 
-6.682 -47.303 -44.240 
-4.094 -47.039 -41.477 
-4.048 -49.055 -38.275 
-1.982 -47.605 -35.449 
-0.523 -49.707 -32.613 
0.508 -47.410 -29.752 
-0.513 -47.804 -26.120 
-0.700 -45.047 -23.536 
1.920 -46.737 -21.421 
5.367 -45.567 -22.392 
3.631 -42.529 -23.872 
3.548 -43.865 -27.435 
-0.098 -42.894 -27.965 
0. 517 -39. 136 -28 . 160 
2.986 -39.252 -31.086 
0.787 -41.864 -32.752 
-2.201 -39.551 -32.401 
-0.197 -36.754 -34.013 
0.466 -38.955 -37.056 
-3.116 -40.222 -37.199 
-4.574 -36.702 -37.190 
-35.623 -39.806 
-38.417 -42.162 
-38.064 -41.424 
-34.511 -42.722 
-36.070 -45.938 
-38.169 -46.353 
-41.246 -44 .508 
-43.535 -42.402 
-44.516 -39.184 
-47.286 -36.730 
-48.070 -33.471 
-51.364 -32.026 
-51.450 -28.257 
-53.141 -25.027 
-52.595 -23.047 
-51.942 -19.632 
-49.367 -20.763 
-45.581 -21.249 
-45.655 -25.029 
-47 . 344 ' -24 . 4 14 
-44.065 -23.009 
-41.484 -24.840 



-2 

-3, 
-6, 



.070 
.136 
.849 
-6.510 
-5. 182 
-8.305 
-7.016 
-9. 175 
-7.417 
-8.219 
-6.456 
-5 . 200 
-4 . 691 
-5.787 
-9. 000 
-7 .455 
-4 . 887 
-4 . 881 
-5.458 
-8 . 821 
-10 . 143 
-8 . 026 
-8 .299 
-12.111 
-11 . 998 
-10. 116 
-12. 938 
-10. 909 
-12.228 
-10.217 
-10.039 
-8 . 809 
-6. 832 



.00 45.28 BBBB 
1.00 38.63 BBBB 
1.00 30.88 BBBB 
1.00 26.66 BBBB 
1.00 23.16 BBBB 
1.00 24 . 54- BBBB 
1.00 29.43 BBBB 
1.00 33.32 BBBB 
1.00 36.08 BBBB 
1 ,00 38 . 51. BBBB 
1.00 36.57 BBBB 
1.00 33.48 BBBB 
1.00 28.22 BBBB 
1.00 27.77 BBBB 
1.00 29.00 BBBB 
1.00 26.12 BBBB 
1.00 25.07 BBB3 
1.00 25.32 BBBB 
1.00 25.94 BBBB 
1.00 25.70 BBBB 
1.00 26.15 BBBB 
1.00 29.32 BBBB 
1.00 32.38 BBBB 
1.00 32.00 BBBB 
1.00 34.91 BBBB 
1.00 37.55 BBBB 



1.00 38.24 BBBB 

1.00 35.75 BBBB 

1.00 34.58 BBBB 

,00 35.40 BBBB 

.00 34.16 BBBB 

.00 31.56 BBBB 
BBBB 



1 . 
1 
1 

1 . 00 27 . 4 1 
1.00 24.71 BBBB 
1.00 23.47 BBBB 
1.00 29.84 BBBB 
1.00 38.81 BBBB 
1.00 4 4.47 BBBB 
.00 40.44 BBBB 
.00 36.33 BBBB 
.00 31.79 BBBB 
.00 32.58 BBBB 



-42.641 -28.449 
-42.601 -28.453 



-39.054 
-38.212 



-27 .064 
-30.259 



-5 
-2 



709 
036 
-2 . 356 
0. 679 
591 
67 1 



2 . 

3. 



-39. 481 -32 .447 
-42.517 -33.514 
-46.083 -33.467 
-48.658 -31.553 
-52.442 -31.720 
-54. 410 -28.713 
-57 . 616 -28 .269 
-59. 416 -25. 133 
-59.770 -24.231 
-60.520 -20.505 
-62. 355 -19. 199 
-61.413 -22.355 
-57.928 -21.277 



1 

1 . 
1 

1 

1.00 35.60 BBBB 
1.00 33.49 BBBB 
1.00 32.68 BBBB 
1 . 00 34 .43 BBBB 
1.00 X 36 . 73 BBBB 
1.00 34.62 BBBB 
1.00 35.34 BBBB 
1.00 33.81 BBBB 
1 . 00 34.16 BBBB 
1 . 00 31 . 38 BBBB 
1.00 31.09 BBBB 
1.00 30.32 BBBB 
1 . 00 28" . 55 BBBB 
1.00 30.76 BBBB 
1.00 31.38 BBBB. 
1.00 37.51 BBBB 
1.00 37.13 BBBB 
1.00 33.17 BBBB 
1.00 30.90 BBBB 
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PCT/US0 1/11500 



9.202 -54.860 -28.639 1.00 21.22 BBBB 
10.407 -52.419 -31.324 1.00 20.50 BBBB 
13.886 -53.949 -31.144 1.00 22.79 BBBB 
12.753 -57.345 -32.424 1.00 22.06BBBB 
9.744 -56.188 -34.431 1.00 23.15 BBBB 
12.123 -54.092 -36.542 1.00 25.29 BBBB 
13.279 -57.337 -38.182 1.00 28.05 BBBB 
9.963 -57.818 -40.016 1.00 26.09 BBBB 
9.331 -54.107 -40.498 1.00 25.03 BBBB 
9.262 -52.595 -43.984 1.00 26.10 BBBB 
10.436 -49.238 -42.618 1.00 24.73 BBBB 
11.947 -48.311 -39.252 1.00 23.62 BBBB 
12.338 -44.736 -37.993 1-00 23.15 BBBB 
13.762 -43.418 -34.712 1.00 25.05 BBBB 
13.559 -40.032 -33.009 1-00 26.88 BBBB 
17 239 -39.820 -32.098 1.00 29.39 BBBB 
20.310 -41.541 -33.535 1.00 31.87 BBBB 
21.629 -44.537 -31.595 1.00 32.62 BBBB 
18.447 -46.476 -30.753 1.00 32.71 BBBB 
18.925 -48.506 -33.937 1.00 34.83 BBBB 
22:158 -48.751 -35.993 1.00 38.97^BBBB 
20.765 -47.568 -39.346 1.00 41.08 BBBB 
17.170 -46.407 -38.843 1.00 37. 55. BBBB 
16.367 -43.044 -40.460 1.00 34.40 BBBB 
16 337 -40.344 -37.764 1.00 31.16 BBBB 
13 155 -38.265 -37.889 1.00 28.10 BBBB 
12.724 -36.921 -34.355 1.00 26.93 BBBB 
9.456 -36.807 -32.375 1.00 25.27 BBBB 
6 315 -34.747 -33.004 1.00 26.14 BBBB 

6 456 -31.379 -31.216 1.00 27.75 BBBB 
3 667 -28.953 -30.246 1.00 32.36 BBBB 
3.038 -26.307 -32.924 1.00 31.74 BBBB 
3 252 -23.404 -30.466 1.00 30.64B3BB 
6*.746 -24.503 -29.440 1-00 25.91 BBBB 

7 780 -25.002 -33.075 1.00 28,46 BBBB 
6.580 -21.455 -33.756 1.00 31.43 BBBB 
9 002 -19.905 -31.268 1.00 29.60 BBBB 

11 611 -17.457 -32.642" 1.00 30.11 BBBB 
15 157 -18.780 -33.062 1.00 28;33 BBBB 
17.450 -18.550 -29.977 1.00 25.25 BBBB 
19 526 -15.527-31.049 1.00 25.46 BBBB 
16.365 -13.525 -31.718 1.00 28.47 BBBB 

14 611 -14.635 -28.525 1.0.0 29.01 BBBB 

17 673 -13.970 -26.331 1.00 29.90 BBBB 
18.766 -10.776 -28.131 1 ..00 30.78 BBBB 

19 846 -7.993 -25.784 1.00 30.10 BBBB 

18 676 -9.965 -22.787 1.00 28.97 BBBB 

20 545 -9.027 -19.621 1.00 31.79 BBBB 

19 871 -9.586 -15.943 1.00 27.75 BBBB 
19.450 -12.832 -13.913 1.00 22.93 BBBB 
19 524 -16.146 -15.729 1.00 18.01 BBBB 
15.873 -17.216 -16.011 1.00 17.62 BBBB 

15 508 -20.771 -14.741 1.00 16.. 47 BBBB 
12.361 -22.710 -15.604 1.00 16.. 75 BBBB 
11.774 -25.775 -13.381 : 1 . 00 18.41 BBBB 

9 298 -28.234 -14.948 1-00 22.11 BBBB 
7 914 -31.188 -12.994 .1.00 27.28 BBBB 

4 935 -32.163 -15.115 1.00 31.94 BBBB 

1 313 -32.665 -14.064 1.00 35.91 BBBB 

2 292 -34 .763 .-11.033 1.00 38.53 BBBB 

5 398 -32.711 -10.350- 1.00 35.02 BBBB 
8*977 -33.819 -9..709 1.00 33.12 BBBB 
9*538 -34.512 -6.010 .1.00 32.63 BBBB 

13 329 -34.168. -6.. 164- 1.00 28 ,10 BBBB 
13069 -30.833 -8.003 1.00 26.53 BBBB 
" 10 497 -29.447 -5.563 1.00 27.07 BBBB 



159 



WO 01/90301 



PCT/US01/ 11500 



ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 



4086 

4095 

4102 

4111 

4 117 

4 126 

4133 

4138 

4143 

4152 

4160 

4164 

4172 

4178 

4185 

4192 

4200 

4214 

4224 

4233 

4239 

4243 

4252 

4256 

4262 

427 1 

4280 

4286 

4293 

4302 

4311 

4316 

4328 

4 333 

4 34 2 

4347 

4351 

4361 

4 367 

4376 

4 386 

4395 

4402 

4 409 

4418 

4429 

4 437 

4 4 4 5 

4 453 

4461 

4 4 66 

4 47 1 

4 476 

4 488 

4 4 93 

4 507 

4512 

4520 

4 527 

4 534 

4 54 1 

4 54 7 

4 558 

4564 

4568 

4 573 



CA 

CA 

CA 

CA 

CA 

CA 

CA 

CA 

CA 

CA 

CA 

CA 

CA 

CA 

CA 

CA 

CA 

CA 

CA 

CA 

CA 

CA 

CA 

CA 

CA 

CA 

CA 

CA 

CA 

CA 

CA 
CA 
CA 
CA 
CA 
CA 

"CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 



HIS 
GLN 



GLN B 200 
THR B 201 
MET B 202 
PRO B 203 
GLN B 204 
VAL B 205 
ALA B 206 
ALA B 207 
LYS B 208 
LEU B 209 
GLY B 210 
ASP B 211 
SER B 212 
VAL B 213 
THR B 214 
ILE B 215 
TRP B 216 
B 217 
B 218 
SER B 219 
GLY B 220 
LY.S B 221 
C5LY B 222 
SER B 223 
GLN B 224 
GLN B 225 
SER B 226 
VAL B 227 
GLU B 228 
GLN B 229 
ALA B 230 
TYR B 231 
ALA B 232 
GLU B 233 
ALA B 234 
GLY B 235 
GLN B 236 
PRO B 237 
B 238 
B 239 
LYS B 240 
VAL B 241 
THR B 242 
GLU B 243 
PHE B 244 
ILE B 245 
ASP B 246 
ASP B 2 47 
MET B 248 
ALA B 24 9 
ALA B 2 50 
ALA B 251 
TYR B 252 
ALA B 253 
TRP B 254 
ALA B 255 
ASP B 256 
VAL B 257 
VAL B 258 
VAL B 2 59 
CYS B 260 
ARG B 261 
SER B 262 
GLY B 263 
ALA B 2 64 
LEU B 265 



GLN 
HIS 



12.955 -30.326 -2.794 1.00 30.10 BBBB 

16.215 -29.345 -4.474 1.00 27.34 BBBB 

15.567 -26.048 -6.268 1.00 23.68 BBBB 

14.608 -23.963 -3.220 1.00 23.84 BBBB 

18.033 -24.708 -1.684 1-00 26.34 BBBB 

19.672 -24.033 -5.043 1.00 24.44 BBBB 

17.980 -20.610 -5.013 1-00 22.84 BBBB 

19.442 -19.857 -1.576 1.00 26.65 BBBB 

22.915 -20.595 -2.919 1.00 28.31 BBBB 

22.577 -18.640 -6.171 1.00 25.68 BBBB 

20.675 -15.628 -4.804 1.00 26.56 BBBB 

20.370 -12.647 -7.190 1.00 28.28 BBBB 

22.098 -14.474 -10.067 1.00 25.73 BBBB 

18.925 -16.308 -11.116 1.00 20.76 BBBB 

15.204 -15.726 -11.337 1.00 19.60 BBBB 

13.076 -18.850 -11.169 1.00 18.75 BBBB 

9.661 -19.973 -12.378 1.00 19.34 BBBB 

9.015 -23.303 -10.680 1.00 21.06 BBBB 

6.149 -25.594 -11.735 1.00 24.30 BBBB 

5.463 -27.800 -8.684 1.00 26.73 BBBB 

2.855 -30.242 -9.961 1.00 30.53 BBBB 

-0.657 -30.914 -8.628 1.00 35.12 BBBB 

-1.195 -29.899 -5.011 1.00 35.34 BBBB 

2.451 -28.934 -4.418 1.00 33.98 BBBB 

2.187 -25.208 -5.186 1.00 33.71 BBBB 

I. 823 -24.239 -1.519 1.00 32.32 BBBB 
4.701 -26.309 -0.122 1.00 28.30 BBBB 
7.214 -25.247 -2.791 1.00 24.28 BBBB 
6.178 -21.592 -2.387 1.00 27.23 BBBB 
6.853 -22.046 1.329 1.00 28.38 BBBB 

10.185 -23.754 0.682 1.00 26.18 BBBB 

11.371 -20.766 -1.366 1.00 25.47 BBBB 

10.342 -18.322 1.368 1.00 27.51 BBBB 

12.145 -20.441 3.966 1.00 30.87 BBBB 

15.215 -20.417 1.714 1.00 28.48 BBBB • 

15.033 -16.627 1.815 1.00. 26.23 BBBB 

14.121 -16.198 -1.870 1.00 25.53 BBBB 

10.336 -15.587 -1.720 1.00 24.65 BBBB 

10.277 -13.558 -4.945 1.00 24.29 BBBB 

. 10.526 -16.608 -7.201 1.00 22.08 BBBB 

7.375 -17.589 -9.105 1.00 23.26 BBBB 

5.740 -20.911 -8.277 1.00 23.78 BBBB 

2.758 -22.301 -10.177 1.00 25.93 BBBB 

0.999 -25.651 -9.837 1.00 27.03 BBBB 

0.964 -26.068 -13.620 1.00 26.54 BBBB 

1.932 -24.242 -16.802 1.00 28.48 BBBB 

-0.754 -24.396 -19.457 1.00 36.00 BBBB 

1.245 -22.392 -21.999 1.00 30.74 BBBB 

4.625 -24.136 -22.138 1.00 28.41 BBBB 

5.512 -22.216 -25.290 1.00 J4.67 BBBB 

5.188 -18.933 -23.390 1.00 21.78 BBBB 

7.301 -20.259 -20.501 1.00 20.85 BBBB 

9.972 -21.616 -22.886 1.00 22.78 BBBB 

10.131 -18.224 -24.636 1.00 23.54 BBBB 
10.829 -16.534 -21.303 1.00 19.76 BBBB 
13.399 -19.025 -20.003 1.00 19.51 BBBB 
17.176 -19.026 -20.434 1.00 17.58 BBBB 
17.535 -22.603 -19.194 1.00 18.53 BBBB 
15.208 -25.456 -18.234 1.00 19.32 BBBB 
15.581 -27.957 -15.374 1.00 19.85 BBBB 
13.454 -31.055 -15.946 1.00 22.00 BBBB 
13.170 -34.800 -16.515 1.00 23.75 BBBB 
13.975 -36,189 -19.948 1.00 23.18 BBBB 

II. 026 -38.079 -21.361 1.00 22.85 BBBB 
11.482 -38.564 -25.115 1.00 24.25 BBBB 

8.846 -36.037 -26.205 1.00 24.66 BBBB 
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ATOM. 4 84 6 CA ALA B 2 99 

ATOM ■ 4851 CA GLY B" 300 

• ATOM- 4855 CA ALA B 301' 

ATOM 4 8 60 CA ALA B 30 2 

ATOM 4 8 65 CA ' LYS B 30 3 

ATOM ; 4874 CA ILE B 304 

ATOM . 4 88 2 CA ILE B" 305 

ATOM 4890 CA .GLU B '306 

ATOM . 4899 CA ' GLN B 307 

ATOM 4909 CA PRO B 308 

ATOM " 4915 CA GLN B'309 

ATOM • 4924 CA LEU B 310 

ATOM 4932 CA SER B.^l! 

ATOM 4 9 38 CA VAL B. 312 

ATOM 4 94 5 CA AS P B 313 

ATOM- 4 95 3 CA ALA B 314 

- ATOM 4 958 CA VAL B 315 

ATOM 4 965 CA ALA B 316 

ATOM . 4970 CA ASN B 317 

ATOM^ 4 97 8 CA . THR B 318 

. ATOM . 4 98 5 CA ' LEU B 3 1 9 

• ATOM 4 99.3 CA ALA B 3 20 

ATOM 4 998 CA ■ GLY B' 321 

ATOM 5002 CA- TRP . B 322 

ATOM 5016 . CA . SER B 323 

ATOM 5022 CA - ARG B 324 

■ ATOM 5033 CA GLU B 325 

ATOM ■ 5042 CA THR" B 326 

ATOM 504 9 CA~ LEU B* 327 

ATOM 5057 CA LEU B 328 

ATOM- -5065. CA THR B 329 

ATOM 5072 CA . MET B 330 

ATOM: .5080 CA* ALA B 3 31 
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10.194 -33.557 -23.657 1.00 22.34 BBBB 
13.730 -33.762 -25.023 1.00 21.11 BBBB 
12.411 -33.191 -28.567 1.00 21.96 BBBB 
10.282 -30.272 -27.378 1.00 21.95 BBBB 
13.295 -28.698 -25.638 1.00 20.62 BBBB 
15.440 -29.058 -28.776 1.00 22.45 BBBB 
12.719 -27.451 -30.898 1.00 22.17 BBBB 
12.361 -24.596 -28.407 1.00 21.97 BBBB 
16.093 -24.023 -28.709 1.00 21.07 BBBB 
16.666 -24.057 -24.966 1.00 19.78 BBBB 
19.651 -25.199 -22.875 1.00 16.62 BBBB 
18.638 -27.807 -20/321 1.00 15.80 BBBB 
19.896 -29.429 -17.145 I . 00 18.48 BBBB 
18.266 -32.838 -17.392 1.00 21.59 BBBB 
17.502 -34.902 -14.281 1.00 25.67 BBBB 
16.698 -38.320 -15.824 1.00 29.05 BBBB 
14.246 -40.496 -13.926 1.00 37.13 BBBB 
16.319 -43.395 -12.591 1.00 41.11 BBBB 
15.641 -46.917 -13.843 1.00 43.69 BBBB 
17.767 -49.993 -14.571 1.00 4 5.34 BBBB 
16.949 -49.299 -18.222 1.00 43.26 BBBB 
17.951 -45.623 -17.883 1.00 36.23 BBBB 
15.622 -44.804 -20.755 1.00 30.77 BBBB 
15.474 -41.099 -19.904 1.00 29.46 BBBB 
19.228 -40.984 -19.550 1.00 29.55 BBBB 
19. 542- -42. 282 -23.116' 1.00.28.07 BBBB 
16.902 -39 . 784 ' -24 . 270 1.00 26.06 BBBB 
18.926 -36.822 -22.979 1.00. 25.69 BBBB 
22 354 -38.088 -24.032 1.00 25.90 BBBB 
21 998 -36.870 -27.635 1.00 26.15 BBBB 
21 521 -33.265 -26.481 1.00 25.42 BBBB 
24.354-33.530 -23.953 1.00 28.78 BBBB 
26 644 -34.947 -26.648 1.00 31.90 BBBB 

25 773 -31.965 -28.847 1.00 30-38 BBBB 

26 777 -29.635 -26.017 1.00 26.18 BBBB 
23.214-28.333 -25.638 1.00 22.50 BBBB 
22.516 -29.770 -22.186 1.00 21.78 BBBB. 
23.979 -31.340 -19.048' 1..00 25.86 BBBB 
221753 -34.598 -17.550 1.00 27 . 17 BBBB 
22.843 -35.178 -13.813 1.00 29.01 BBBB 
21.664 -38.702 -13.061 1.00 34 . 65- BBBB 
20.377 -39.599'- -9.613 1.00 40.54 BBBB 
23.828 -40.891 -8.484 1.00 43.20 BBBB 
25 247 -37.361- -8.787 1.00 43.46 BBBB 
22^32 -35.166. -8.022 1.00 39.65 BBBB 
22.660 -32.714 -5.154 1.00 34.90 BBBB 

21 990 -29.074 -4.341 1.00 31.50 BBBB 
25.642 -28.202 -4.957 1.00 29.61 BBBB 
25 782 -30.099 -8.254 1.00 26,47 BBBB 

22 755 -28.215 -9.612 1.00 25-33 BBBB 

23 888 -24.872 . -8.199 1.00 27..13 BBBB 
27.444 -25.246- -9.518 1.00 28.52 BBBB 
26.174 -26.371 -12.906 -1.00 27.04 BBBB 
23 883 -23.370 -13.357 1.00 25.21 BBBB 

'26 445 -20.931 -11.^957 1.00 24.59 BBBB 
28 934 -22.031 -14.591 1.00 24.34 BB3B 
26.738 -21.007 -17.521 l.OO 21.72 BBBB 
' 27.141 -17.404 -18.692 1.00 19.04 BBBB 

•24.725 -15.741 -21.112 1.00 18,09 BBBB 
27 220 -16.368 -23.954 1.00 16.96 BBBB 
27 460 -20.055 -23.070 1.00 16.39 BBBB 

-23.659 -20.305 -22.780- 1.00 17.27 BBBB 

•23 175 -18 . 745 --26 . 222 1-00 17.39 BBBB 
25 567 -21.335-27.688 1.00 21.30 BBBB 
23 771 -24 . 153* -25.870 1.00 19.91 BBBB 
20*412 -22.871 -27.098 1.00 18.49 BBBB 
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TABLE 3 ATOMIC COORDINATES OF E.COLI MURG C-ALPHA 
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BACKBONE AND CONSERVED AMINO ACID RESIDUES 
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2 69 3 
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27 2 2 
2723 
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2726 
2727 
2728 
2729 
2731 
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2750 
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2760 
2768 
2773 
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2805 
2813 
2821 
2826 
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2839 
2853 
2862 
2869 
2880 
2894 
2902 
2906 
2913 
2918 
2926 
2937 
2945 
2954 
2959 
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N 
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CA 
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N 

CA" 
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CB 

CG 



LYS B 
ARG B 
LEU B 
MET B 
VAL B 
MET B 
ALA B 
GLY B 
GLY B 
GLY B 
GLY B 
GLY B 
GLY B 
GLY B 
GLY B 
THR B 
GLY 
. GLY 
GLY B 
GLY B 
GLY B 
HIS B 
HIS B 
HIS B 
HIS 
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B 
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HIS B 
VAL B 
PHE B 
PRO B 
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LEU B 
ALA B 
VAL B 
ALA B 
HIS B 
HIS B 
LEU B 
MET B 
ALA B 
GLN B 
GLY B 
TRP B 
GLN B 
VAL B 
ARG B 
TRP B 
LEU B 
GLY B 
THR B 
ALA B 
ASP B 
ARG B 
MET B 
GLU B 
ALA B 
ASP B 



7 
. 8 
9 
10 
11 
12 
13 
14 
14 
14 
14 
15 
15 
15 
15 
16 
17 
18 
18 
18 
18 
19 
19 
19 
19 
19 
19 
19 
19 
19 
19 
20 
21 
22 
23 
24 
25 
26 
27 
28 
29 
30 
31 
32 
33 
34 
35 
36 
37 
38 
39 
40 
41 
4 2 
43 
44 
45 
46 
47 
48 
49 



-6.512 
-6. 682 
-4 .094 
-4 . 048 
-1 . 982 
-0.523 

0. 508 

0. 150 
-0.513 
-0 . 107 

0 . 975 
-0 . 986 
-0 .700 

0.539 
1.293 

1 .920 
5.367 

. 94 9 
.631 
.825 
.34 5 
.416 
.548 
.772 
.957 
.281 
.845 
.046 
. 936 
.280 
. 300 
.098 
. 517 
. 986 
0.787 
-2. 201 
, 197 
.466 
, 116 
,574 
-2.070 
-3. 136 
-6.849 
-6. 510 
-5. 182 
-8 . 305 
-7 . 016 
. 175 
.417 
.219 
. 456 
. 200 
-4 . 691 
-5.787 
-9.000 
-7 .455 
-4 .887 
-4.881 
-5.458 
-8 .821 
-10.143 
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3 
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3. 

3 
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-0, 
0 , 
-3. 
-4 



-9. 
-7 . 
-8 . 
-6. 
-.5. 



45. 403 
47 . 303 
47.039 
49. 055 
47 . 605 
49. 707 
•47 . 410 
•47 . 934 
-47 . 804 
-46. 595 
-46. 040 
-46. 188 
-45. 047 
-45 . 254 
-44 . 311 
-46. 787 
-45 . 567 
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-20 . 259 
-21 . 616 
-18 . 224 
-16. 534 
-19 . 025 
-19 . 026 
-22 . 603 
-25 . 456 
-27 . 957 
-31 . 0 55 
-33 . 397 
-34 . 800 
-35 . 663 
-35 . 337 
-36. 473 
-36.721 
-37 . 176 
-37 .433 
-37 . 370 
-34 . 871 
-33 . 383 
-36. 189 
-37. 151 
-38 .079 
-37 .793 



-4 . 804 
-7 . 190 
■10 . 067 
-11 . 116 
-11 . 337 
-11 . 169 
-12 . 378 
-10 . 680 
-11 ,735 
-8 . 684 
-9 . 961 
-8 . 628 
-5.011 
-4 . 4 18 
-5 . 18 6 
-1 . 519 



122 
791 



1.00 26.56 BBBB 
1.00 28.28 BBBB 
.00 25.73 BBBB 
.00 20.76 BBBB 
.00 19.60 BBBB 
.00 18.75 BBBB 
1.00 19.34 BB3B 
1.00 21.06 BBBB 
.00 24.30 BBBB 
.00 26.73 B3BB 
,00 30.53 BBBB 
.00 35.12 BBBB 
.00 35.34 BBBB 
.00 33.98 BBBB 
.00 33.71 BBBB 
.00 32.32 BBBB 
.00 28.30 BBBB 
. 00 24 .'28 BBBB 



-2 . 387 
1 . 329 



1 
1 
1 

1 . 
1 . 
1 . 
1 . 
1 
1 
1 
1 



00 27.23 BBBB 



0 . 682 

1 . 366 
1 . 368 
3.966 

714 
815 
870 



1 , 
1 , 
-1 



1.00 28.38 BBBB 
1 . 00 26.. 18 BBBB 
1.00 25.47 BBBB 
1 . 00 27.51 BBBB 
1.00 30.87 BBBB 
1 . 00 28.48 BBBB 
1.00 26.23 BBBB 
1 . 00 25.53 BBBB 
-1 . 720 1 . 00 24 . 65 BBBB 
-4.945 1.00 24.29 BBBB 
-7 . 201 1 . 00 22 . 08 BBBB 
-9.105 1.00 23.26 BBBB 
-8.277 1.00 23.78 BBBB 
-10.177 1V00 25.93 BBBB 
-9 . 837 1". 00 27 . 03 BBBB 
-13.620 1.00 26.54 BBBB 
-16 . 802 1 . 00 28 . 48 BBBB ' 
r'19.457 1.00 36.00 BBBB 
-21.999 1.00 30.74 BBBB 
-22.138 1.00 28.41 BBBB 
-25.290 1.00 24.67 BBBB 
-23.390 1.00 21.78 BBBB 
-20.501 1.00 20.85 BBBB 
-22.886 1.00 22.78 BBBB 
-24.636 1.00 23.54 BBBB 
-21 .303 1.00 19.76 BBBB 
-20:003 1.00 19.51 BBBB 
-20.434 1.00 17.58 BBBB 
-19.194 1.00 18:53 BBBB 
-18.234 1.00 19.32 BBBB 
-15 . 374 1 . 00 19.85 BBBB 
-15.946 1.00 22.00 BBBB 
-16.212 1.00 22.34 BBBB 
-16. 515 1 .00 23.75 B3BB 
-16 . 104 1 . 00 27 .16 BBBB 
-14 .738 1 .00 31.82 BBBB 
-13.732 1.00 36.33 BBBB 
-13 . 323 1 .00 38 . 48 BBBB 
-12 . 125 1 .00 37 . 25 BBBB 
-11 . 204 1 . 00 38 . 46 BBBB 
-11.848 1.00 36.79 BBBB 
-18 .032 1 .00 *23.'98 BBBB 
-18 ! 746 1.00 22.44 BBBB 
-19.948 1.00 23-. 18 BBBB 
-20. 619 1 .00 22.74 BBBB 
-21.361 1.00 22.85 BBBB 
-22.813 1.00 '24:06 BBBB 
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ATOM 


4566 


O 


GLY 


B 


263 


11 . 


908 


-36. 


705 


-23. 


121 


1 . 


00 


22. 


75 


BBBB 


ATOM 


4567 


N 


ALA 


B 


264 


11. 


130 


-38 . 


739 


-23. 


708 


1 . 


00 


23. 


37 


BBBB 


ATOM 


4568 


CA 


ALA 


B 


264 


11. 


482 


-38. 


564 


-25. 


115 


1. 


00 


24 . 


25 


BBBB 


ATOM 


4569 


CB 


ALA 


B 


264 


11. 


133 


-39. 


829 


-25. 


894 


1 . 


00 


24 . 


58 


BBBB 


ATOM 


4570 


C 


ALA 


B 


264 


10. 


843 


-37 . 


343 


-25. 


783 


1 . 


00 


24 . 


29 


BBBB 


ATOM 


4571 


O 


ALA 


B 


264 


11 . 


523 


-36. 


572 


-26. 


470 


1 . 


00 


24 . 


33 


BBBB 


ATOM 


4573 


CA 


LEU 


B 


265 


8 . 


846 


-36. 


037 


-26. 


205 


1 . 


00 


24 . 


66 


BBBB 


ATOM 


458 1 


CA 


THR 


B 


266 


10. 


194 


-33. 


557 


-23. 


657 


1 . 


00 


22 . 


34 


BBBB 


ATOM 


4588 


CA 


VAL 


B 


267 


13. 


730 


-33. 


762 


-25. 


023 


1 . 


00 


21 . 


11 


BBBB 


ATOM 


4595 


CA 


SER 


B 


268 


12 . 


4 11 


-33. 


191 


-28. 


567 


1 . 


00 


21 . 


96 


BBBB 


ATOM 


4600 


N 


GLU 


B 


269 


10. 


928 


-31 . 


563 


-27. 


557 


1. 


00 


21 . 


64 


BBBB 


ATOM 


4 601 


CA 


GLU 


B 


269 


10. 


282 


-30. 


272 


-27 . 


378 


1 . 


00 


21 . 


95 


BBBB 


ATOM 


4602 


CB 


GLU 


B 


269 


9. 


213 


-30. 


399 


-26. 


292 


1 . 


00 


24 . 


72 


BBBB 


ATOM 


4603 


CG 


GLU 


B 


269 


8 . 


480 


-29. 


128 


-25. 


940 


1 . 


00 


27 . 


67 


BBBB 


ATOM 


4 604 


CD 


GLU 


B 


269 


7 . 


385 


-29. 


380 


-24 . 


908 


1 . 


00 


30 . 


05 


BBBB 


ATOM 


4605 


OE1 


GLU 


B 


269 


6. 


325 


-29 . 


915 


-25. 


287 


1 . 


00 


31 . 


50 


BBBB 


ATOM 


4606 


OE2 


GLU 


B 


269 


7 . 


591 


-29. 


057 


-23. 


719 


1 . 


00 


29. 


84 


BBBB 


ATOM 


4607 


C 


GLU 


B 


269 


11 . 


321 


-29. 


214 


-26. 


999 


1 . 


00 


21 . 


68 


BB3B 


ATOM 


4608 


O 


GLU 


B 


269 


11 . 


301 


-28 . 


095 


-27. 


518 


1 . 


00 


18 . 


12 


BBBB 


ATOM 


4610 


CA 


ILE 


B 


270 


13. 


295 


-28. 


698 


-25. 


638 


1 . 


00 


20. 


62 


BBBB 


ATOM 


4618 


CA 


ALA 


B 


271 


15. 


440 


-29. 


058 


-28. 


776 


1 . 


00 


22 . 


45 


BBBB 


ATOM 


4623 


CA 


ALA 


B 


272 


12 . 


719 


-27 . 


451 


-30. 


898 


1 . 


00 


22. 


17 


BBBB 


ATOM 


4628 


CA' 


AtA 


B 


273 


12. 


361 


-24 . 


596 


-28. 


407 


1 . 


00 


21 . 


97 


BBBB 


ATOM 


4633 


CA 


GLY 


B 


274 


16. 


093 


-24 . 


023 


-28. 


709 


1 . 


00 


21 . 


07 


BBBB 


ATOM 


4637 


CA 


LEU 


B 


275 


16. 


666 


-24 . 


057 


-24. 


966 


1 . 


00 


19. 


78 


BBBB 


ATOM 


464 6 


CA 


PRO 


B 


276 


19. 


651 


-25. 


199 


-22. 


875 


1 . 


00 


16. 


62 


BBBB 


ATOM 


4652 


CA 


ALA 


B 


277 


18 . 


638 


-27 . 


807 


-20. 


321 


1 . 


00 


15 . 


80 


BBBB 


ATOM 


4657 


CA 


LEU 


B 


278 


19. 


896 


-29. 


429 


-17. 


. 145 


1 . 


, 00 


18 . 


.48 


BBBB 


ATOM 


4665 


CA 


PHE 


B 


279 


18 . 


.266 


-32 . 


.838 


-17 , 


, 392 


1 . 


. 00 


21 . 


.59 


BBBB 


ATOM 


4676 


CA 


VAL 


B 


280 


17 . 


, 502 


-34 . 


. 902 


-14 . 


, 281 


1 . 


. 00 


25 . 


. 67 


BBBB 


ATOM 


4 682 


N 


PRO 


B 


281 


17 , 


. 324 


-37 , 


. 080 


-15. 


. 370 


1 . 


. 00 


27 . 


.08 


BBBB 


ATOM 


4683 


CD 


PRO 


B 


281 


18 . 


.750 


-37 . 


.057 


-15. 


,726 


1 . 


. 00 


27 . 


. 31 


BBBB 


ATOM 


4684 


CA 


PRO 


B 


281 


16. 


. 698 


-38, 


. 320 


-15 


.824 


1 


.00 


29 


.05 


BBBB 


ATOM 


4 685 


CB 


PRO 


B 


281 


17 , 


.851 


-39 


.071 


-16 


.492 


1 


.00 


29 


.44 


BBBB 


ATOM 


4 686 


CG 


PRO 


B 


281 


18 . 


.791 


-37 , 


. 992 


-16 


.895 


1 


. 00 


29 


.67 


BBBB 


ATOM 


4 68 7 


C 


PRO 


B 


281 


16 


.092 


-39 


. 121 


-14 


. 684 


1 


. 00 


31 


.51 


BBBB 


ATOM 


4688 


O 


PRO 


B 


281 


16 


. 675 


-39 


. 223 


-13 


. 603 


1 


.00 


32 


.26 


BBBB 


ATOM 


4690 


CA 


PHE 


B 


282 


14 


.246 


-40 


.496 


-13 


. 926 


1 


. 00 


37 


. 13 


BBBB 


ATOM 


4701 


CA 


GLN 


B 


283 


16 


.319 


-43 


. 395 


-12 


.591 


1 


. 00 


41 


. 11 


BBBB 


ATOM 


4710 


CA 


HIS 


B 


284 


15 


. 641 


-46 


. 917 


-13 


.843 


1 


. 00 


43 


.69 


BBBB 


ATOM 


4720 


CA 


LYS 


B 


285 


17 


.767 


-49 


. 993 


-14 


.571 


1 


. 00 


45 


. 34 


BBBB 


ATOM 


4729 


CA 


ASP 


B 


286 


16 


. 949 


-49 


.299 


-18 


.222 


1 


. 00 


4 3 


.26 


BBBB 


ATOM 


4 737 


CA 


ARG 


B 


287 


17 


. 951 


-45 


. 623 


-17 


.883 


1 


.00 


36 


.28 


BBBB 


ATOM 


4 748 


CA 


GLN 


B 


288 


15 


. 622 


-44 


. 804 


-20 


.755 


1 


.00 


30 


. 77 


BBBB 


ATOM 
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N 


GLN 


B 
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15 


.378 


-42 
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-19 


.857 


1 


.00 


29 


.38 


BBBB 


ATOM 
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B 
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15 


.474 


-41 


. 099 


-19 


.904 


1 


.00 


29 


.46 


BBBB 
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CB 
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B 
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14 


.772 


-40 


.472 


-18 


.700 


1 


.00 


29 


.25 


BBBB 
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CG 


GLN 


B 


289 


13 


.265 


-40 
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-18 


.883 


1 


.00 


29 


.32 


BBBB 


ATOM 


4760 


CD 


GLN 


B 


289 


12 


.575 


-39 


. 585 


-17 


.826 


1 


.00 


29 


.84 


BBBB 


ATOM 


4761 


OE1 


GLN 


B 


289 


13 


. 191 


-38 


.728 


-17 


. 188 


1 


.00 


29 


.52 


BBBB 


ATOM 


4762 


NE2 


GLN 


B 


289 


11 


.281 


-39 


. 821 


-17 


. 647 


1 


.00 


ia 


. 95 


BBBB 


ATOM 


4763 


C 


GLN 


B 


289 


16 


. 906 


-40 


. 613 


-20 


.005 


1 


.00 


29 


. 36 


BBBB 


ATOM 


4764 


O 


GLN 


B 


289 


17 


.173 


-39 


. 557 


-20 


. 585 


1 


.00 


29 


. 12 


BBBB 


ATOM 


4766 


CA 


TYR 


B 


290 


19 


.228 


-40 


. 984 


-19 


.550 


1 


. 00 


29 


. 55 


BBBB 


ATOM 


4778 


CA 


TRP 


B 


291 


19 


.542 


-42 


. 282 


-23 


.116 


1 


. 00 


28 


. 07 


BBBB 


ATOM 


4791 


N 


ASN 


B 


292 


17 


. 653 


-40 


. 779 


-23 


. 508 


1 


.00 


25 


. 52 


BBBB 


ATOM 


4792 


CA 


ASN 


B 


292 


16 


. 902 


-39 


. 784 


-24 


.270 


1 


.00 


26 


.06 


B3BB 


ATOM 


4793 


CB 


ASN 


B 


292 


15 


.484 


-39 


. 599 


-23 


.709 


1 


. 00 


24 


. 78 


BBBB 


ATOM 


4794 


CG 


ASN 


B 


292 


14 


. 590 


-40 


. 811 


-23 


. 928 


1 


. 00 


24 


. 46 


BBBB 


ATOM 


4795 


OD1 


ASN 


B 


292 


14 


.842 


-41 


. 641 


-24 


.798 


I 


.00 


25 


. 33 


BBBB 


ATOM 


4796 


ND2 


ASN 


B 


292 


13 


. 523 


-40 


. 900 


-23 


. 146 


1 


.00 


23 


.83 


BBBB 


ATOM 


4797 


C 


ASN 


B 


292 


17 


. 605 


-38 


.427 


-24 


.258 


1 


.00 


25 


.99 


BBBB 


ATOM 


4798 


O 


ASN 


B 


292 


17 


. 566 


-37 


. 687 


-25 


.244 


1 


.00 


26 


. 18 


BBBB 


ATOM 


4799 


N 


ALA 


B 


293 


18 


. 242 


-38 


. 105 


-23 


.139 


1 


.00 


25 


.66 


BBBB 


ATOM 


4300 


CA 


ALA 


B 


293 


18 


. 926 


-36 


.822 


-22 


. 979 


1 


.00 


25 


. 69 


BBBB 


ATOM 


4301 


CB 


ALA 


B 


293 


18 


. 940 


-36 


.422 


-21 


. 506 


1 


.00 


24 


. 17 


BBBB 
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ATOM 


4802 


C 


ALA 


B 


293 


ATOM 


4803 


O 


ALA 


B 


293 


ATOM 


4805 


GA 


LEU 


B 


294 


ATOM 


4814 


CA 


PRO 


B 


295 


ATOM 


4820 


CA 


LEU 


B 


296 


ATOM 


4828 


CA 


GLU 


B 


297 


ATOM 


4837 


CA 


LYS 


B 


298 


ATOM 


4846 


CA 


ALA 


B 


299 


ATOM 


4851 


CA 


GLY 


B 


300 


ATOM 


4855 


CA 


ALA 


B 


301 


ATOM 


4860 


CA 


ALA 


B 


302 


ATOM 


4865 


CA 


LYS 


B 


303 


ATOM 


4874 


CA 


ILE 


B 


304 


ATOM 


4882 


CA 


ILE 


B 


305 


ATOM 


4890 


CA 


GLU 


B 


306 


ATOM 


4899 


CA 


GLN 


B 


307 


ATOM 


4909 


CA 


PRO 


B 


308 


ATOM 


4915 


CA 


GLN 


B 


309 


ATOM 


4 924 


CA 


LEU 


B 


310 


ATOM 


4932 


CA 


SER 


•B 


311 


ATOM 


4938 


CA 


VAL 


B 


312 



ATOM 4 94 5 CA ASP B 313 

ATOM 4 95 3 CA .AlA B 314 

ATOM 4 958 CA VAL B 315 

ATOM 4 965 CA ALA B 316 

ATOM 4970 CA ASN B 317 

ATOM . 4978 CA THR B 318 

ATOM 4 98 5 CA LEU B 319 

ATOM 4993 CA ALA B 320 

ATOM 4998 CA GLY B 321 

ATOM 5002 CA TRP B 322 

ATOM .5016 CA SER B 32 3 

ATOM 5022 CA ARG B 324 

ATOM 5033 CA GLU- B 325 

' ATOM 5042 ' CA THR B 326 

'ATOM 5049 CA LEU B 327 

ATOM 5057 CA LEU B'328 

'ATOM 5065 CA THR B 329 

ATOM 5072 CA MET B -330 

ATOM - 5080 CA ALA B 331 

ATOM • 5085 CA ' GLU B -332 

ATOM 5094 CA ARG B 333 

ATOM 5105 CA ALA B 334 

ATOM 5110 CA ARG B-335 

. ATOM 5121- CA ALA B 336 

ATOM 5126 • CA ALA B 337 

7VTOM ' 5131 CA SER B 338 

• ATOM " 5137 CA I LE ■ B 339 

ATOM 514 6 CA PRO B 34 0 

ATOM * 5152 OA ASP B 34 1 

.ATOM 5160 CA ALA B 342 

ATOM : 5165 CA THR B 34 3 

ATOM ■ 5172 CA GLU B 344 

ATOM 5181 ■ 4 CA ARG B 34 5 

' ATOM 5192 • CA VAL B 34 6 

ATOM 5199 CA ALA B 34 7 

ATOM 5204 CA ASN B 34 8 

ATOM 5212 CA GLU B 349 

ATOM 5221 ' CA VAL B 350 

' ATOM 522.8 . CA SER B 351 

- .ATOM 5234 CA ARG B 3-52 

\ ATOM 524 5 CA • VAL B 353 

• ATOM 5252 . CA ALA B 354. 

ATOM 5257 .CA ARG B 355 

"ATOM 5268 CA ALA B 356 

ATOM 5273 ' CA LEU B 357 




PCT/USO 1/11500 



20 346 -36.800 -23.521 1-00 25.67 BBBB 

20 855 -35.743' -23.902 1-00 25.52 BBBB 

22 354 -38.088 -24.032 1.00 25.90 BBBB 

21 998 -36.870 -27.635 1-00 26.15 BBBB 
21*521 -33.265 -26.481 1.00 25.42 BBBB 

24 354 -33.530 -23.953 1.00 28.78 BBBB 
26644 -34.947 -26.648 1.00 31.90 BBBB 

25 773 -31.965 -28.847 1.00 30.38 BBBB 
26777 -29.635 -26.017 1.00 26.18 BBBB 

23 214 -28.333-25.638 1.00 22.50 BBBB 

22 516 -29.770 -22.186 1.00 21.78 BBBB 

23 979 -31.340 -19.048 1.00 25.86 BBBB 

22 753 -34.598 -17.550 1.00 27.17 BBBB 
22*843 -35.178 -13.813 1.00 29.01 BBBB 

21 664 -38.702 -13.-061 1.00 34.65 BBBB 
20 377 -39.599 -9.613 1.00 40.54 BBBB 

23 828 -40.891 -8.-484 1.00 43.20 BBBB 
25 247 -37.361 -8.787 1.00 43.46 BBBB 

22 232 -35.1-66 -8.022 1.00 39.65 3BBB 
22*660 -32.714 -5.154 1.00 34.90 BBBB 
21*990 -29.074 -4.341 1.00 31.50 BBBB 
25*642 -28.202 -4.957 1.00 29.61 BBBB 
25 782 -30.099 -8.254 1.00 26.47 BBBB 
22*755 -28.215 -9.612 1.00 25.33 BBBB 

23 888 -24.872 -8.199 1.00 27.13 BBBB 
27*444 -25.246 -9.518 1.00 28.52 BBBB 
26*174 -26.371 -12.906 l.OO 27.04 BBBB 
23*883 -23.370 -13.357 1.00 25.21 BBBB 
26*445 -20.931 -11.957 1.00 24.59 BBBB 
28*934 -22.031 -14.591 1.00 24.34 BBBB 
26*738 -21.007 -17.521 1.00 21.72 BBBB 
27 141 -17.404 -18.692 1.00 19.04 BBBB 
24*725 -15.741 -21.112 1.00 18.09 BBBB 
27 220 -16.368 -23.954 1.00 16.96 BBBB 
27 460 -20.055 -23.070 1.00 16.39 BBBB 
23*659 -20.305 -22.780 1.00 17.27 BBBB 
23*175-18.745 -26.222 1.00 17.39 BBBB 
25 567 -21.335-27.688 1.00 21.30 BBBB 
23 771 -24.153-25.870 1.00 19.91 BBBB 

20 412 -22.871 -27.098 1.00 18.49 BBBB 

21 626 -22.827 -30.704 .1.00 '.21.47 BBBB 
23 040 -26.330 -30.408 1.00 23.77 BBBB 

19 648 -27.420 -29.063 1.00 22.88 BBBB 
17*795 -25.892-32.002 1.00 23.54 BBBB 

20 330 -27.477 -34.372 1.00 26.85 BBBB 
19*740 -30.925 -32.865 .1.00 30.89 BBBB 
16 008 -30.432 -33.408 1.00 32.41 BBBB 
13 882 -31.941 -36.187 1.00 34.35 BBBB 
10 733 -29.730 -36.600 "1.00 34.94 BBBB 

8 711 -31.820 -39.056 1.00 33.33 BBBB 
8*875 -35.238 -37.411 1.00*29.09 BBBB 
5 115 -35.696 -37.744 1.00 23.55 BBBB 
5 085 -34.933 -41.480 1.00 32.00 BBBB 
8 138 -37.123 -42.067 1.00 31.44 BBBB 
6*578 -40 151 -40.384 1.00-28.61 BBBB 
3*249 -39.617 -42.137" 1.00-28.96 BBBB 
5035 -39.286 -45.493 1.00 34.56 BBBB 
6*954 -42- 540" -44 .956 1.00 34.86 BBBB 
3*767 -44.306 -43.919 1.00 33.79 BBBB 
2*196 -42 946 r47.09S 1.00 36.67 BBBB 
5*114 " -44.088 -49.251 1.00 40.03 BBBB 
5*089-47.587 -47.737 1.00 42.73 BBBB 
1*336 -47.957 -48.212' 1.00 47.24 BBBB 
2*035-46-964 -51.824 1.00 52.71 BBBB 
4*453 -49' 913 -51.809 1.00 54.93 BBBB 
7*023' -47.522 -53.289 1.00 57.81 BBBB 
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TABLE 4 ATOMIC COORDINATES OF THE 



DONOR NUCLEOTIDE BINDING SITE 



REMARK 4 1MUR COMPLIES WITH 

ATOM 1 N LEU B 187 

ATOM 2 CA LEU B 187 

ATOM 3 C LEU B 187 

ATOM 4 O LEU B 187 

ATOM 5 CB LEU B 187 
ATOM 6 CG LEU B 18 7 

ATOM 7 CD1 LEU B 187 

ATOM 8 CD2 LEU B 187 

ATOM 9 N VAL B 188 

ATOM 10 CA VAL B 188 

ATOM 11 C VAL B 188 

ATOM 12 O VAL B 188 

ATOM 13 CB VAL B 188 

ATOM 14 CGI VAL B 188 

ATOM 15 CG2. V&L B 188 

ATOM 16 N ' VAL B 189 

ATOM 17 CA VAL B 189 

ATOM 18 C VAL B 189 

ATOM 19 O VAL B 18 9 

ATOM 20 CB VAL B 18 9 

ATOM 21 CGI VAL B 18 9 

ATOM 22 CG2 VAL B 18 9 

ATOM 23 N GLY B 190 

ATOM 24 CA GLY B 190- 

ATOM 25 C GLY B 190 

ATOM 26, O GLY B 190 

ATOM 27 N GLY B 191 

ATOM 28 CA GLY B 191 

ATOM 29 C GLY B 191 

ATOM 30 O GLY B 191 

ATOM -31 N ALA B 195 

ATOM 32 CA ALA B 195 

ATOM 33 C ALA B 195 
ATOM 3 4 O ALA B 195 

ATOM 3 5 CB ALA B 195 

ATOM 36 N LEU B 198 

ATOM" 37 CA LEU B 198 

ATOM 38 C LEU B 198 

ATOM 39 O LEU B 198 

ATOM 4 0 CB LEU B 198 

ATOM 41 CG LEU B 198 

ATOM 42 CD1 LEU B 198 

ATOM 43 CD2 LEU B 198 

ATOM 4 4 N TYR B 2 52 

ATOM 4 5 CA TYR B 2 52 

ATOM 4 6 C TYR B 2 52 

ATOM 47 O • TYR B 252 

ATOM ' 48 CB • TYR B 252 
ATOM 4 9 CG TYR B 2 52 

ATOM 50 CD1 TYR B -252 

ATOM 51 CD2 TYR. B 252 - 

ATOM 52 CE1 TYR B 252 

ATOM 53 CE2 TYR B 252 

ATOM 54 CZ TYR B 252 

ATOM 55 OH TYR B 252 

ATOM 56 N VAL B 2 58 

ATOM 57 CA VAL B 258 

• ATOM 58 C VAL B 258 



FORMAT V. 
13.695 
12 . 361 
12 . 450 
13. 115 
11.813 
10 . 445 
9 . 368 
10 . 198 
11.783 
11.774 
10 . 434 

9.371 
11 . 902 
12.088 
13.061 
10 . 493 

9 .298 

9. 191 
10 .067 

9.299 

8 .009 

9 .470 
8 . Ill 
7.914 



808 
668 
025 
935 
676 
691 
761 

8 . 977 
9.423 

9 . 955 
10 .073 
12 .897 
13.069 
12 . 388 
12 . 930 
12 .504 
13 . 196 
12 . 625 
14 . 692 

8 .723 
9.972 
10 .566 
11.784 
9.726 
9. 662 
9.003 
10.288 
8 . 961 
10.253 
9. 590 
9. 554 
16. 263 
15.208 
15 .799 



2.0, 11 
22. 128 
22 .710 
■24 . 146 
-24 . 982 
•22 . 701 
-23 . 340 
-22 . 478 
-23 . 449 
-24 .426 
-25.775 
-26.440 
-25.967 
-25.714 
-27 . 126 
-24 .818 
-27 . 532 
-28 .234 
-29..639 
-30.478 
-28 . 342 
-29.013 
-26. 943 
-29.887 
-31 . 188 
-32 . 026 
-33 . 208 
-31 . 430 
-32. 163 
-32 . 104 
-31 . 556 
-33.045 
-33.819 
-33.590 
-32.533 
-33. 387 
-32.223 
-30.833 
-29 . 893 
-28 .835 
-30 . 616 
-31 . 408 
-31 .007. 
-31 . 146 
-21 . 314 
-21 . 616 
-20 . 354 
-20. 180 
-22 . 661 
-24 . 100 
-25.065 
-24 . 505 
-26. 392 
-25.838 
-26.772 
-28 . 088 
-24 . 643 
-25.4.56 
-26. 585 



-MAY-2000 
-15. 588 
-15. 604 
-15 .085 
-15. 688 
-17 .035 
-17 . 276 
-16. 625 
-18.. 783 
-13 . 964 
-13.381 
-13.739 
-13.336 
-11.842 
-11.270 
-11.449 
-14 . 496 
-14 . 948 
-14 . 351 
-14 . 559 
-16. 488 
-16. 981 
-17 . 101 
-13 . 615 
-12 .994 
-13 . 604 
-13 . 283 
-14 .497 
-15 . 115 
-14 .269 
-13 . 165 
-9.938 
-9'. 709 
-8 .267 
-7 . 923 
-10 . 679 
-7 . 590 
-8 .003 
-7 . 006 
-6. 667 
-9.412 
-10. 524 
-11 .874 
-10 . 493 
-22. 184 
-22 . 886 
-23 . 516 
-23.550 
-23.980 
-23. 505 
-24 .261 
-22 . 319 
-23.861 
-21 .912 
-22 .687 
-22 . 305 
-18.818 
-18 .2 34 
-17 . 389 



1.00 15.92 N 
1.00 16.75 C 
1 .00 16.85 C 
1.00 17.18 O 
1 . 00 16. 85 C 
1 . 00 18 . 63 C 
1 . 00 19.42 C 
1 . 00 19. 11 C 
1.00 18.20 N 
1.0018.41 C 
1.00 19.88 C 
1.00 20.39 O 
1.00 18.98 C 
1 . 00 18 . 50 C 
1 . 00 18 .83 C 
1 . 00 21 . 55 N 
1.00 22.11 C 
1 . 00 23 . 90 C 
1.00 23.61 O 
1 . 00 22 . 50 C 
1.00 22.70 C 
1.00 21.26 C 
1.00 25.60 N 
1.00 27.28 C 
1.00 29.67 C 
1 . 00 29.86 O 
1.00 30.56 N 
1.00 31.94 C • 

1.00 33.11 C 
1.00 32.14 O 
1.00 33.54 N 
1.00 33.12 C 
1 . 00 32 . 37 C 
1 . 00 31 . 47 O 
1 . 00 33 . 17 C 
1 . 00 27 . 07 N 
1 . 00 26 . 58 C 
1.0026.41 C 
1 . 00 26. 35 O 
1 . 00 25 . 88 C 
1.00 25.40 C 
1.00 26.54 C 
1.00 25.94 C 
1.00 21.27 N 
1 . 00 22 .73 C 
1.00 23.57 C 
1 .00 23. 91 O 
1 . 00 21 . 62 C 
1 . 00 23 . 34 C 
1 . 00 22 .88 C 
1 . 00 22 . 30 C 
1.00 24.81 C 
1.00 23.56 C 
1 . 00 24 . 26 C 
1.00 25.57 O 
1 .00 18.74 N 
1:00 19.32 C 
1.00 19.70 C 
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SER B 268 
SER B .268 
SER B 268 
SER B 268 
SER B 268 
GLU B 269 
GLU B 269 
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B 269 
B 269 
GLU B 269 
GLU B 2.69 
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GLU 
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C 
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CG 
CD1 
. CD2 
CEl 
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N 

CA 
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CB 



B 270 
ILE B 270 
ILE B 270 
ILE B 270 
ILE B 270 
ILE B 270 
ILE B 270 
ILE B 270 
ALA B 27 7 
ALA B 277 
ALA B 27 7. 
ALA B 277 
ALA B 27 7 
LEU B 278 
LEU B 278 
LEU B 278 
LEU B 278 
LEU B 278 
LEU B 278 
LEU B 278 



LEU B 278 
PHE B 279 
PHE B 279 
PHE B 279 
PHE B 279 
PHE B 279 
PHE .B 279 
. PHE B .279 
PHE B 279. 
PHE B 279 
PHE B 279 
PHE B 279 
VAL; B 280 
VAL B 280 
VAL B 280 
VAL B 280 
VAL B.280 



CGI VAL B 280 
CG2 VAL B 280 
PRO B 281 
PRO t B 281 
PRO B 281 
PRO.B 281 
PRO B 281 
PRO B 281 
PRO 3 281 
PHE. B 282 
PHE 



. N 
. CA 
C 

o 

CB 
CG 
CD 
N 

CA 
C 

o 

. CB 
CG 



B 282 
PHE B 282 
PHE B 282 
PHE~ B 282 
PHE B 282 



12.411 - 
11.817 - 
12 . 158 - 
11.474 - 
12.141 - 
10.928 - 
10.282 - 
11.321 - 
11.301 - 
9.213 - 
8 . 480 - 
7 . 385 • 
6. 325 
7 . 591 
12 . 224 
13.295 
14 .214 
14 . 595 
14 . 157 
13. 337 
15.415 
12.926 
19.316 
18 . 638 
19.591 

20 .710 
17 . 641 
19. 147 
19. 896 
18.884 
17 . 870 
20. 140 
21 . 084 

21 . 283 
20 . 497 
19 . 149 
18 .266 
18 . 525 
19. 671 
13.385 

17 .740 

18 .481 
16 . 379 
17 . 874 . 
15.759 
16.515 
17.445 
17 . 502 
16. 690 
15. 509 
16 . 883 
16. 954 
17 . 631 
17 . 324 
16. 698 
16.092 
16. 675 
17 .851 
18.791 
18.750 
14 . 908 
14 .246 
15.078 
15.357 
12 .818 
12 .032 



33.191 - 
31.790 - 
30.933 - 
34.121 • 
35.316 - 
31.563 • 
30.272 
29. 214 
-28 .095 
•30.399 
-29. 128 
-29. 380 
-29.915 
-29.057 
-29. 581 
-28 . 698 
-28.314 
-27. 151 
-29.391 
-29.574 
-28.595 
-28 . 291 
-27 . 110 
-27 . 807 
-28.526 
-28 .891 
-28 . 805 
-28.673 
-29.429 
-30.535 
-30.330 
-28.619 



-29. 308 
-28 .411 
-30 . 647 
-31 . 691 
-32.838 
-33.709 
-34 . 065 
-33.700 
-33.099 
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.1 



ATOM 
ATOM 
ATOM 
ATOM 



373 CB ILE B 141 

374 CGI ILE B 141 

375 CG2 ILE B 141 

376 CD1 ILE B 141 



8.854 -58.721 -39.405 1.00 26.39 

8.298 -58.092 -38.127 1.00 26.45 

9.401 -60.118 -39.145 1.00 28.33 

7.136 -58.845 -37.549 1.00 26.95 



C 
C 

c 



TER 



The following examples are presented for purposes of illustration only and are 
not intended to limit the scope of the invention in any way. 



This example* describes the crystallization of the E. coli MurG protein and the 
determination of the coordinates of the three-dimensional crystal structure. This example 
also describes the identification of the donor nucleotide binding site, the acceptor binding 
site and the membrane association site of the MurG protein. 



The 1.9 A X-ray structure of a membrane-associated glycosyltransferase involved in* 
peptidoglycan biosynthesis is reported. This enzyme, MurG, contains two ?/? open sheet 
domains separated by a deep cleft. The C-terminal domain contains the UDP-GlcNAc 
binding site while the N-terminal domain contains the acceptor binding site and likely 
membrane association site. Combined with sequence data from other MurG homologs, 
this structure provides insight into the residues that are important in substrate binding 
and catalysis. We have also noted that a conserved region found in many UDP-sugar 
transferases maps to a ?/?/?/? supersecondary structural motif in the donor binding region 
of MurG, an observation that is be helpful in glycosyltransferase structure prediction. 



Crystallization 

£. coli MurG containing a C-terminal LEHHHHHH sequence was purified as 
described(Ha et aL, 1999) and concentrated to 10 mg ml" 1 in 20 mM Tris-HCl, pH 7.9/ 
150 mM NaCl/ 50 mM EDTA. The protein concentrate was mixed with UDP-GlcNAc 



EXAMPLE 1 



Abstract 



Methods 
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in a 1:3 molar ratio. Crystals were grown at room temperature using the hanging-drop 
vapor-diffusion method by mixing equal volumes of protein with reservoir solution (0.1 
M NaMES, pH 6.5/ 0.96 M (NH 4 ) 2 SO a / 0.4% Triton X-100/ 10 mM DTT). Triclinic 
crystals with a typical size of 0.2 mm X 0.1 mm X 0.1 mm grew within a week. The 
crystals belong to the PI space group, with two molecules per asymmetric unit. The cell 
dimensions are a=60.613 A, b=66.356 A, c=67.902 A, ?=64.294, ?=83.520, ???????????? 

Data collection and processing 

All' data sets were collected at 100 K on previously flash frozen crystals. Crystals were 
equilibrated in a cryoprotectant buffer with 0.1 M NaMES, pH 6.5, 1.44 M (NH 4 ) 2 S0 4 , 
0.4% Triton X-100, and 20% glycerol. Heavy-atom soaks were carried out in the same 
buffer containing one of the following heavy-atom solutions: 2 mM HgCl 2 , 1 mM 
(NH 4 ) 2 WS 4 , 1 mM (NH 4 ) 2 OsBr 6 . Crystals were flash- frozen in liquid nitrogen. HgCl 2 
(form A derivative) and (NH 4 ) 2 OsBr 6 derivative data were collected at an R-AXISIIC 
imaging plate detector mounted on a Rigaku 200HB generator. Native, HgCl 2 (form B 
derivative), and (NH 4 ) 2 WS 4 derivative diffraction data were collected at beam-line 
. BioCARS-14B at the Advanced Photon Source, at wavelengths 1.0092 A, 0.9900 A and 
1.2147 A respectively. Collection of data on the HgCl 2 derivative was initially designed 
for MAD phasing; however, the mercury derivative proved to be unstable to X-rays, and 
after a two-hour exposure to synchrotron radiation the form A derivative metamorphosed 
into a different mercury derivative (form B) that was suitable for MIR phasing. All the 
data were reduced using DENZO and SCALEPACK (Otwinowski & Minor, 1997), and* 
processed with CCP4 programs (CCP4, 1994). 

Structure determination and refinement 

The structure was solved by multiple isornorphous replacement combined with 
anomalous scattering of mercuric derivatives (Table I). Initial MIR phases calculated 
with program MLPHARE had a mean figure of merit of 0.44 to 2.5 A, and were 
improved by solvent flattening and histogram matching using DM. An MIR map was 
generated which had continuous electron density for most regions of the protein. A 
model was built with the program O (Jones et ai y 1991), and the structure was refined 
against 1.9 A data using energy minimization, simulated annealing and B-factor 
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refinement with the program CNS (Brunger et at., 1998). The N-terminal six residues 
and the C-terminal His-tag had no electron density and were not included in this model. 
There was no electron density for UDP-GlcNAc. 

Results and discussion 

Overall fold 

The crystal structure of £. colt MurG was solved by a combination of multiple 
isomorphous replacement and anomalous scattering, and refined to 1.9 A resolution 
(Table 1). 



Table I. Summary of crystallographic and refinement data 



Data set 


Native 


HgCl2 (form A derivative) 


HgCl2 (form B derivative) 


(NH 4 ) 2 WS4 


(NH 4 ) 2 0sBi 


Resolution (A) 


1.9 


2.0 


1.9 


2.4 


2.3 


Observations 


288,150 


101,913 


245,320 


44,366 


106,606 


Unique reflections 


65,567 


53,391 


65,581 


27.950 


36,443 


Rsym^last shell) 


0.032 (0.t87) 


0.043 (0.200) 


0.042 (0.296) 


0.031 (0.080) 


0.056 (0.30: 


\H (last shell) 


41.9(7.0) 


20.4(2.9) 


29.0 (3.7) 


24.6 (8.2) 


19.6(2.5) 


Completeness (last shell) 


97.7% (96.4%) 


9 1.4% (66.6%) 


97.4% (94.0%) 


83.8% (62.0%) 


94.5%(78.6< 


MIR analysis ( 40.0 - 2.5 A) 












Mean isomorphous difference^ 


0.163 


0.130 


0.068 


0.134 


Phasing power 3 (last shell) 




1.09 (0.73) 


0.57(0.50) 


0.61 (0.24) 


061 (0.58! 


Rcullis 4 0ast shell) 




0.81 (0.91) 


0.94 (0.96) 


0.92 (0.99) 


0 94(0.95; 


Anomalous Rcullis ('ast shell) 


0.96(1.00) 


0.95 (1.00) 






Refinement statistics 












Resolution 


40.0- 1.9 A 




R. m. s. d. 7 






Reflections ( |F| > 2? ) 


61,989 




Bonds (A) 




0.006 


Protein atoms (a. u.) 


5,280 




Angles ( 0 ) 




L29 


Water Atoms 


298 










Sulfate groups 


1 




Ramachandran plot 8 






R-factor 5 


22.0% 




Residues in most favored 


region* 


94.6% 


R-free 6 


24.7% 




Residues in additional allowed region 


5.4% 



' Rsym ~ ? Mi ' I / ? l i» where \{ is the intensity of a reflection, and <1> is the average intensity of that reflection. 

^Mean isomorphous difference - ? JFpH - Fp| / ?Fph, where Fpn and Fp are the derivative and native structure factors respectively. 

^Phasing power is the ratio of the mean calculated derivative structure factor to the mean lack of closure error. 

4 R cu ||i 3 is the mean residual lack of closure error divided by the dispersive or anomalous difference. 

5R-factor = '>i|F obs |-jF ca i c i|/?|F obs | 

GR-free is the R-factor calculated using 10% of the reflection data chosen randomly and omitted from the start of refinement. 
~R m. s. d., root-mean-square deviations from ideal bond lengths and bond angles. 
^Calculated with program PROCHECK. 
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The structure consists of two domains separated by a deep cleft (Fig. 2a). Both domains 
exhibit an ?/? open-sheet structure and have high structural homology despite minimal 
sequence homology (RMSD = 2.02 over 85 aligned C? atoms). The N-domain includes 
residues 7-163 and 341-357, and contains seven parallel ?-strands and six ?-helices, the 
last of which originates in the C-domain (Fig. 2b). The C-domain comprises residues 
164-340 and contains six parallel ?-strands and eight ?-helices, including one irregular 
bipartite helix (?-link) that connects the N-domain to the First ?-strand of the C-domain. 
The ?-strands in both domains are ordered as for a typical Rossman fold. The N- and C- 
domains are joined by a short linker between the seventh ?-strand of the N-domain and 
the ?-link of the C-dbmain. This inter-domain linker and the peptide segment that joins 
the last helix of the C-domain to the last helix of the N-domain define the floor of the 
cleft between the two domains. The cleft itself is about 20 A deep and 18 A across at its 
widest point. Contacts < 4 A across the cleft are limited primarily to interactions 
between residues from C-?5 to the loop connecting N-?5 to N-?5. 

The ?/? open-sheet motif (Rossman fold) adopted by both the N- and C-domains 
of MurG is characteristic of domains that bind nucleotides (Branden & Tooze, 1998). 
Classical Rossman domains typically contain at least one conserved glycine rich motif, 
with the consensus sequence GXGXXG, located at a turn between the carboxyl end of 
one ?-strand and the amino terminus of the adjacent ?-helix (Baker et al: 9 1992). This 
motif is involved in binding the negatively charged phosphates (Carugo & Argos, 1997). 
There are three glycine rich loops (G loops) in E. coli MurG (Fig. 3a) that may. be 
variants on the phosphate binding loops found in other dinucleotide binding proteins (see 
below). • 

Sequence homology , 

Amino acid sequences for eighteen MurG homologs are now available. The sequence 
similarity between E. coli MurG and homologs from other bacterial strains ranges from 
less than 30% to more than 90% depending on the evolutionary relationship between the 
organisms. In all MurG homologs, however, there are several invariant residues. Fig. 3a 
shows a sequence alignment for a subset of MurG homologs with the invariant and 
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highly conserved residues indicated. These residues, which include the three G loops, 
have been highlighted in the E. coli MurG structure (Fig. 3b). Almost all of the invariant 
residues are located at or near the cleft between the two domains. Two of the G loops are 
found in the N domain (between N-?l/N-?l and N-?4/N-?4) and one is found in the C- 
domain (between C-?l/C-?l). The strict conservation of the highlighted residues among 
different bacterial strains, and their location as determined from the E. coli MurG 
structure, implicates them in substrate binding and catalytic activity. 

Structural homology reveals the donor binding site 

The three-dimensional backbone structure of E. coli MurG was compared to known 
protein structures, including the three other NDP-glycosyltransferase structures that have 
been reported (Charnok & Davies, 1999; Gastinel et at., 1999; Vrielink et at., 1994). 
The C-terminal domain was found to have significant structural homology (RMSD= 
2.218 A for 89 aligned C? atoms) to the C-terminal domain of phage T4 ?- 
glucosyltransferase (BGT), an enzyme that catalyzes the glucosylation of 
hydroxymethyl-cytosines in duplex DNA. A co-crystal structure of BGT with UDP 
bound in the C-terminal domain reveals the topology of the UDP binding pocket and also 
shows important contacts to the nucleotide (Morera et at., 1999; Vrielink et at., 1994). 
These contacts include: a) hydrogen bonds from the backbone amide of 1238 to the N3 
and 04 positions of the base; b) hydrogen bonds between the carboxyl side chain of 
E272 and the 02* and 03' hydroxy Is of the ribose ring; and c) contacts from a GGS 
motif in the loop following the first ?-strand of the C domain to the alpha phosphate of 
UDP. The structurally homologous C-domain of MurG contains a topologically similar 
pocket (Fig. 4a). Furthermore, even though the two domains share only 1 1% sequence 
identity overall, there are identical residues in the same spatial location in E: coli MurG 
and in BGT. Based on this comparison, we have concluded that the C-domain of E. coli 
MurG is the UDP-GlcNAc binding site. 

We have docked UDP-GlcNAc into the C-domain of E. coli MurG using the 
information on how UDP binds to BGT as a guide. As shown in Figure 4b, the uracil is 
held in place by contacts from the N3 and 04 atoms to the backbone amide of 1245. The 
02' and 03' hydroxyls on the ribose sugar are within hydrogen bonding distance of the 
invariant glutamate residue (E269) in the middle of helix C-?4. The conserved GGS 
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motif in G loop 3 is positioned to contact the alpha phosphate. When these contacts are 
made, the LTDP-GlcNAc substrate fits nicely into a pocket in the C-domain, where it is 
surrounded by many of the invariant residues identified through sequence analysis of 
other MurG homologs. It is possible to propose roles for some of these invariant 
residues from the model. For example, the side chain of R261 can be rotated to contact 
the second phosphate; this contact may help explain why LTDP binds significantly better 
to MurG than UMP. We propose that R261 plays an important role in catalysis by 
stabilizing the UDP leaving group via electrostatic interactions. The side chain of Q289 
is within hydrogen bonding distance of the C4 hydroxyl of the GlcNAc sugar: This 
contact may explain why MurG can discriminate between UDP-GlcNAc and its C4 axial 
isomer, UDP-GalNAc (Ha et al. y 1999). 

The acceptor binding site 

Structural considerations suggest that the primary acceptor binding site is located in the 
N-terminal domain of MurG. This domain contains three highly conserved regions, two 
of which are glycine-rich loops that face the cleft (Fig 3a arid 4c). Thesei G loops are 
reminiscent of the phosphate binding loops found in other nucleotide binding proteins, 
and are most likely involved in binding to the diphosphate on Lipid I. The N-termini of 
the helices following each G loop form opposite walls of a small pocket between the G 
loops. The helix dipoles create a positively charged electrostatic field in the pockefcthat 
can stabilize the negative charged diphosphates. When the diphosphate of the acceptor is 
anchored in the pocket created by the G-loops, the MurNAc sugar emerges into the cleft 
between domains and the C4 hydroxyl can be directed towards the anomeric carbon of 
the GlcNAc for attack on the face opposite the UDP leaving group. The third conserved 
region in the N domain spans the loop from the end of N-?5 to the middle of N-?5. 
Kinetic analysis of mutants is required to evaluate the roles of these residues (Ha et al. y 
1999; Men etal., 1998). 

Proposed membrane association site • 

MurG associates with the cytoplasmic surface of bacterial membranes where it couples a 
' soluble donor sugar to the membrane anchored acceptor sugar, Lipid L Analysis of the 
' E. coli MurG structure shows that there is a hydrophobic patch consisting of residues 
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175, L79, F82, W85 and Wl 16 in the N-domain, which is surrounded by basic residues 
(K72, K140, K69, R80, R86, R89). We propose that this is the membrane association 
site and that association involves both hydrophobic and electrostatic interactions with the 
negatively charged bacterial membrane. The location of this patch in MurG is also 
consistent with the proposed acceptor binding site: membrane association at this patch 
would bring the two N-terminal G loops close to the membrane surface where the 
diphosphate portion of the acceptor is located (Fig. 4c). Moreover, the cleft between the 
two domains would remain accessible, consistent with the biochemical requirement that 
the soluble UDP-GlcNAc donor be able to find its binding site from the cytoplasm. 

Implications for other glycosyltransferases 

Glycosyltransferases' that utilize an activated nucleotide sugar as a donor comprise a 
large family of enzymes in both prokaryotes and eukaryotes, and they play central roles 
in many important biological processes (Dennis et al y 1999; Koya et al., 1999; Verbert 
& Cacan v 1999). Glycosyltransferases are typically classified according to the nucleotide 
sugar they utilize, and it has frequently been noted that there is no significant sequence 
homology even among glycosyltransferases in the same family. This has made it 
difficult to identify common structural features and residues important in binding and 
catalysis. There are only three other glycosyltransferase structures available, and 
although none of them shows any sequence homology to MurG, a structural comparison 
indicates that one of them, BGT, contains a related donor binding site. 

In addition to this structural homology, we have identified a strikingly similar 
sequence motif in the MurG family and certain other UDP-glycosyltransferase families. 
This sequence motif spans about a thirty amino acid stretch in the C-domain of MurG 
and includes most of the invariant residues found in that domain. As shown in Figure 3a, 
a similar motif is found in the UDP-glucuronosyltransferases (Mackenzie, 1990). 
Certain residues are identical, including a number of prolines and glycines, and the 
spacing between them is invariant. This suggests that the UDP-glucuronosyltransferases 
contain a region of?/? supersecondary structure that is involved in a similar function as 
the corresponding region in MurG (Fig. 3c). This region binds the donor sugar. By 
analyzing the similarities and differences between the conserved residues in this 
subdomain in the MurG family and other UDP-glycosyltransferase families, it may be 
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possible to identify - and perhaps alter - residues that are involved in determining donor 
selectivity. We note that it would be useful to be able to manipulate donor specificity 
because it would extend the utility of glycosyltransferases as reagents for glycosylation 
of complex molecules. Altered glycosyltransferases could also be useful for remodeling 
cell surfaces and for probing the biological roles of particular carbohydrate structures. 

Conclusion 

This first structure of a member of the MurG family of glycosyltransferases lays the 
groundwork for further mechanistic and structural investigations, which may lead to the 
design of inhibitors and perhaps even new antibiotics. The work also shows that there 
can be conserved subdomains even in very different glycosyltransferase families. 
Information on conserved subdomains will be useful for structure prediction and may 
help guide experiments directed towards changing substrate specificity. 

EXAMPLE 2 

1 This example describes a method of isolating the C-terminal domain of the E. coli 
MurG protein, expressing the domain in E. coli cells and utilizing nuclear magnetic 
resonance (NMR) to determine the ability of compounds to bind. 

MurG can also be used to determine the ability of a chemical compound to bind 
to the C domain by a) determining the start of c domain based on the MurG crystal 
structure; b) independently expressing the C domain;; and c) using NMR methods to 
identify binding site and/or bound conformation of ligand. The same procedure is used 
for the acceptor binding domains. 

1 NMR methods are used to identify the protein binding sites nad screen for ligands 
that bind. The MurG C-terminal domain region of the protein has been expressed 
independently. The C domain has a much lower molecular weight than the full-length 
protein. Therefore, the expression of the C domain results in much sharper NMR peaks 
which will facilitate the NMR interpretation. Also the protin chemical shifts are very 
sensitive to their environment. Binding of a compound will introduce local environment 
• changes," thus changing the proton chemical shifts. In this way, residues involved in the 
binding can be differentiated easily from other amino acid residues not- involved in 
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binding a ligand. This method has also been used to identify Iigands that bind to low 
molecular weight drug targets (i.e., small proteins). ' 

Relevant references to NMR techniques are: Discovering high-affinity Iigands 
for proteins: SAR by NMR, S. Shuker, P. Hajduk, R. Meadows, and S. Fesik, Science 
274, 1531 (1996); Lin Y, Nageswara Rao BD. Structural characterization of adenine 
nucleotides bound to Escherichia coli adenylate kinase. 1. Adenosineconformations by 
proton two-dimensional transferred nuclear Overhauser effect spectroscopy. 
Biochemistry. 2000 Apr 4;39(13):3636-46; and Fejzo J, et at., Chem Biol 1999 
Oct;6(lO):755-69 (incorporated herein by reference). 

The SHAPES strategy is also useful for NMR identification of binding residues, 

Iigands and drug discover which is an NMR-based approach for lead generation in drug 

» 

discovery. Recently,' it has been shown that nuclear magnetic resonance (NMR) may be 
used to identify Iigands that bind to low molecular weight protein drugtargets. 
Recognizing the utility of NMR as a very sensitive method for detecting binding, we 
have focused on developing alternative approaches that are applicable to larger molecular 
weight drug targets and do not require isotopic labeling. A new method for lead 
generation (SHAPES) uses NMR to detect the binding of a limited but diverse library of 
small molecules to a potential drug target. The compound scaffolds are derived from 
shapes most commonly found in known therapeutic agents. NMR detection of low 
(microM-mM) affinity binding is achieved using either differential line broadening or 
transferred NOE (nuclear Overhauser effect) NMR techniques. The SHAPES method for 
lead generation by NMR is useful for identifying potential lead classes of drugs early in a 
drug design program, and is easily integrated with other discovery tools such as virtual 
screening, high-throughput screening and combinatorial chemistry. 



EXAMPLE 3 

This example describes the method of using the three-dimensional structure of E. 
coli MurG to determine the crystal structures of its mutant, enzyme-ligand complex, and 
MurG homologs, which share the same folding motif. 

First, a crystalline form of the new protein or the protein complex should be 
obtained. The E.coli MurG mutants should crystallized in a condition very similar to 
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what we have showed in the method section. The protein-ligand complex can be 
obtained by soaking the protein crystals in a ligand-containing buffer. Other MurG 
homologs can be expressed in a His-tagged fashion and purified using affinity colume. 
Presumably they can' be crystallized in a similar way using a detergent as the additive. 
Next, the diffraction data should be collected and processed. After the data collection, 
the molecular replacement method is used to determine the unknown structure. Either 
the whole E. coli MurG protein or one single domain can serve as a search model. This 
search model can be rotated and translated until the correct orientation is located in the 
unit cell of this unknown structure. The search model may only represent part of the 
contents of the asymmetric unit. However, the location of the first model is now already 
available. While the first location of the search model is fixed, the second round of 
translation search can be carried out to search more molecules or domains in the 
asymmetric unit cell. The phases from the final model generated by molecular 
replacement can be used to calculate the electron density map. Finally, a model is built 
based on the electron density map, and the model needs to be refined using program CNS 
orXPLOR. 



EXAMPLE 4 - 

.' * ■ * * ' - * ■ , v * 

" '" This example describes the method of using the three-dimensional coordinate 
structure of E. coli MurG to produce a protein fragment that can be usednn an NMR- 
based lead discovery program. The crystal structure reveals the boundaries of the C 
domain and permits us to design a gene containing only the C domain from the gene 
containing both domains: The C domain starting at residue 164 and ending at residue 
340 was cloned into an expression vector to generate a C-terminal His tag fusion. It was 
over-expressed in E.. coli cells and purified by affinity colume. The protein was shown to 
be monomeric by size exclusion chromatography and to be soluble at least up to 0.15 
mM, a concentration more than adequate for NMR analysis. G domains from other Murg 
homologues can be similarly expressed and used. 

' : . ■ EXAMPLE 5 - • 
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This example describes the co-crystallization of a MurG protein with a ligand. A 
MurG-ligand complex is formed by either co-crystallizing MurG protein with 
appropriate ligand or soaking the MurG crystals in buffers containing appropriate ligand. 
Co-crystallization is done by pre-mixing the protein sample with a certain amount of 
substrate or substrate analogs. Then the hanging drop method is used to produce crystals 
as described infra. 

Alternatively, ligans are incorporated into the crystals by soaking the protein 
crystals in the ligand containing buffer for a period of time to allow for infiltration into 
the crystal. The time ranges from a couple of hours to a couple of days. The 
concentration of ligand in the buffer ranges from several milimolar to several hundred 
mili molar. 



DEPOSIT OF COORDINATES 
The crystal structure three-dimensional coordinates of the E. coli MurG as set 
forth in Table 1 were deposited with the Protein Data Bank and have been assigned the 
indicated ID Code (Accession No.) 1F0K. 

Although the invention is described in detail with reference to specific 
embodiments thereof, it will be understood that variations which are functionally 
equivalent are within the scope of this invention. Indeed, various modifications of the 
invention in addition to those shown and described herein will become apparent to those 
skilled in the art from the foregoing description and accompanying drawings. Such 
modifications are intended to fall within the scope of the appended claims. » 

Various publications are cited herein, the disclosures of which are incorporated 
by reference in their entireties. 
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1. A composition comprising the £. coli MurG protein in crystalline form. 

2. A composition comprising a MurG protein in crystalline form. 

3. A three dimensional structure of the crystalline form of an E. coli MurG protein, 
wherein the three dimensional structure substantially conforms to the atomic coordinates 
represented in Table 1. 

4. A three dimensional structure of the crystalline form of a MurG protein, wherein 
the three dimensional structure substantially conforms to the atomic coordinates 
represented in Table I. 

5. A three dimensional structure of the a-carbon backbone of the crystalline form of 
an E. coli MurG protein, wherein the three dimensional structure substantially conforms 
to the atomic coordinates represented in Table 2. 

6. A three dimensional structure of the a-carbon backbone and conserved amino 
acid residues of an E .coli MurG protein, wherein the three dimensional structure 
substantially conforms to the atomic coordinates represented in Table 3. 

7. A three dimensional structure of a donor nucleotide binding site of a MurG 
protein wherein the three dimensional structure structure of the donor nucleotide binding 
site substantially conforms to the atomic coordinates in Table 4. 

8. The three dimensional structure of claim 7, wherein the donor nucleotide is 
UDP-GlcNAc. 

9. A three dimensional structure of an acceptor binding site of a MurG protein 
substantially conforming to the atomic coordinates in Table 5. 

10. A three dimensional structure of a membrane association site of a MurG protein 
substantially conforming to the atomic coordinates in Table 6. 

11. A three-dimensional computer image of the three-dimensional structure of a 
MurG protein. 

12. The image of claim 11, wherein the structure substantially conforms with the 
three-dimensional coordinates listed in Table 1. 

13. The image of claim 11, wherein the computer image is that is generated when a 
set of three-dimensional coordinates comprising the three-dimensional coordinates 
represented in Table 1 are analyzed on a computer using a graphical display software 
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program to create an electronic file of the image and visualizing the electronic file on a 
computer capable of representing the electronic file as a three-dimensional image. 

14. The image of claim 11, wherein the three-dimensional computer image is 
represented by a two dimensional image selected from the group consisting of Fig. 2a, 
3a, or 4c. 

15. The image of claim 11, wherein the three-dimensional computer image is used to 
design a compound. 

16. A three dimensional computer image of the three dimensional structure of the a- 
carbon backbone of a MurG protein. 

17. The image of claim 16, wherein the structure substantially conforms with the 
three-dimensional coordinates listed in Table 2. 

18. The image of claim 16, wherein the computer image is that, is generated when a 
set of three-dimensional coordinates comprising the three-dimensional coordinates 
represented in Table 2 are analyzed on a computer using a graphical display software 
program to create an electronic file of the image and visualizing the electronic file on- a 
computer capable of representing the electronic file as a three-dimensional image. • « 

19. The image of claim 16, wherein the three-dimensional computer image is used to 
design a compound. - 

20. A three dimensional image of the three dimensional image of an a-carbon 
backbone and conserved amino acid residues of a MurG protein. 

21. The image of claim 20, wherein the structure substantially conforms with the 
three-dimensional coordinates in Table 3. 

22. The image of claim 21, wherein the computer image is that is generated when a 
set of three-dimensional coordinates comprising the three-dimensional coordinates 
represented in Table 3 are analyzed on a computer using a graphical display software 
program to create an electronic file of the image and visualizing the electronic file on a 

. computer capable of representing the electronic file as a three-dimensional image. 
23 . The image of claim 21, wherein the three-dimensional computer image is used to 
design a compound. 

24. A three-dimensional computer image of the three-dimensional structure of a 
donor nucleotide binding site of a MurG protein. 
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25. The image of claim 24, wherein the structure substantially conforms with the 
three-dimensional coordinates in Table 4. 

26. The image of claim 24, wherein the computer image is that is generated when a 
set of three-dimensional coordinates comprising the three-dimensional coordinates 
represented in Table 4 are analyzed on a computer using a graphical display software 
program to create an electronic file of the image and visualizing the electronic file on a 
computer capable of representing the electronic file as a three-dimensional image. 

27. The image of claim 24, wherein the three-dimensional computer image is 
represented by a two dimensional image selected from the group consisting of Fig. 3c, 4a 
or 4b. 

28. The image of claim 24, wherein the three-dimensional computer image is used to 
design a compound. 

29. A three-dimensional computer image of the three-dimensional structure of an 
acceptor binding site of a MurG protein. 

30. The image of claim 29, wherein the structure substantially conforms with the 
three-dimensional coordinates Table 5. 

31. The image of claim 29, wherein the computer image is that is generated when a 
set of three-dimensional coordinates comprising the three-dimensional coordinates 
represented in Table 5 are analyzed on a computer using a graphical display software 
program to create an electronic file of the image and visualizing the electronic file on a 
computer capable of representing the electronic file as a three-dimensional image. 

32. The image of claim 29, wherein the three-dimensional computer image is 
represented by the two dimensional image of Fig. 4a. 

33. The image of claim 29, wherein the three-dimensional computer image is used to 
design a compound. 

34. A three-dimensional computer image of the three-dimensional structure of a 
membrane association site of a MurG protein. 

35. The image of claim 34, wherein the structure substantially conforms with the 
three-dimensional coordinates Table 6. 

36. The image of claim 34, wherein the computer image is that is generated when a 
set of three-dimensional coordinates comprising the three-dimensional coordinates 
represented in Table 6 are analyzed on a computer using a graphical display software 
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program to create an electronic Tile of the image and visualizing the electronic file on a 
computer capable of representing the electronic file as a three-dimensional image. 

37. The image of claim 34, wherein the three-dimensional computer image is 
represented by the two dimensional image of Fig. 4a. 

38. The image of claim 34, wherein the three-dimensional computer image is used to 
design a compound. 

39. A computer readable medium encoded with a set of three-dimensional 
coordinates of a MurG protein having a three-dimensional structure that substantially 
conforms to the atomic coordinates of Table 1, wherein using a graphical display 
software program, the three-dimensional coordinates create an electronic file that can be 
visualized on a computer capable of representing said electronic file as a three 
dimensional image. 

40. A computer readable medium encoded with a set of three-dimensional 
coordinates of an a-carbon backbone of a MurG protein having a three-dimensional 
structure that substantially conforms to the atomic coordinates of Table 2, wherein using 
a graphical display software program, the three-dimensional coordinates create an 
electronic file that can be visualized on a computer capable of representing said 
electronic file as a three dimensional image. 

41. A computer readable medium encoded with a set of three-dimensional 
coordinates of an a-carbon backbone and conserved amino acid residues of a MurG 
protein having a three-dimensional structure that substantially conforms to the atomic 
coordinates of Table 3, wherein using a graphical display software program, the three- 
dimensional coordinates create an electronic file that can be visualized on a computer 
capable of representing said electronic file as a three dimensional image. 

42. A computer readable medium encoded with a set ' of three-dimensional 
coordinates of a donor nucleotide binding site of a MurG protein having a three- 
dimensional structure that substantially conforms to the atomic coordinates of Table 4, 
wherein using a graphical display software program, the three-dimensional coordinates 
create an electronic file that can be visualized on a computer capable of representing said 
electronic file as a three dimensional image. 

43. A computer readable medium encoded with a set of three-dimensional 
coordinates of an acceptor binding site of a MurG protein having a three-dimensional 
structure that substantially conforms to the atomic coordinates of Table 5, wherein using 
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a graphical display software program, the three-dimensional coordinates create an 
electronic file that can be visualized on a computer capable of representing said 
electronic file as a three dimensional image... 

44. A computer readable medium encoded with a set of three-dimensional 
coordinates of a membrane association site of a MurG protein having a three- 
dimensional structure that substantially conforms to the atomic coordinates of Table 5, 
wherein using a graphical display software program, the three-dimensional coordinates 
create an electronic file that can be visualized on a computer capable of representing said 
electronic file as a three dimensional image. 



45. A method for identifying a potential inhibitor of a UDP-glycosyltransferase enzyme, 
the method comprising the steps of: 

a. using a three-dimensional structure of UDP-glycosyltransferase enzyme as defined by 
atomic coordinates of UDP-glycosyltransferase enzyme according to FIG. 5; 

b. employing said three-dimensional structure to design or select said potential inhibitor; 

c. synthesizing said potential inhibitor; and 

d. contacting said potential inhibitor with said UDP-glycosyltransferase enzyme in the 
presence of a substrate to test the ability of said potential inhibitor to inhibit said UDP- 
glycosyltransferase enzyme. 

46. The method according to claim 45, wherein said potential inhibitor is selected from a 
database. 

47. The method according to claim 45, wherein said potential inhibitor is designed de 
novo. 

48. The method according to claim 45, wherein said potential inhibitor is designed from 
a known inhibitor. 

49. The method according to claim 45, wherein said step of employing said three- 
dimensional structure to design or select said potential inhibitor comprises the steps of: 

a. identifying chemical entities or fragments capable of associating with UDP- 
glycosyltransferase enzyme; and 

b. assembling the identified chemical entities or fragments into a single molecule to 
provide the structure of said potential inhibitor. 
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50. The method according to claim 45, wherein the potential inhibitor is a competitive 
inhibitor of mutant UDP-glycosyltransferase enzyme. 

51. The method according to claim 45, wherein said potential inhibitor is a non- 
competitive or uncompetitive inhibitor of mutant UDP-glycosyltransferase enzyme. 

52. A model of a UDP-glycosyltransferase, wherein the model represents a three- 
dimensional structure that substantially conforms to the atomic coordinates of Table I. 

53. The model of claim 52, wherein the structure substantially conforms to the 
atomic coordinates and B-values represented by Table 1. 

54. The model of claim 52, wherein the structure is monomeric. 

55. The model of claim 52, wherein at least about 50% of the structure has an 
average root-mean-square deviation (RMDS) of less than about 2.5 A for backbone 
atoms in secondary structure elements in each domain of the structure. 

56. The model of claim 52, wherein the MurG protein comprises an amino acid t 
sequence that is at least about 25% identical to the amino acid sequence of the £. coli 
MurG protein. 

57. The model of claim 52, wherein the MurG protein comprises an amino acid 
sequence that is at least about 40% identical to the amino acid sequence of the E. coli 
MurG protein. 

58. The model of claim 52, wherein the MurG protein comprises an amino acidr 
•sequence that is at least about 60% identical to the amino acid sequence of the E. coli 

MurG protein. 

59. The model of claim 52, wherein the MurG protein comprises an amino , acid 
sequence selected, from the group consisting of the amino acid sequence of a MurG 
protein from Escherichia coli, Bacillus subtilis, Aquefex aeolicus, Borrelia burgdorferi, 
Chlamydia pneumoniae, Chlamydia trachomatis. Enterococcus faecais, Enterococcus 
hirae, Haemophilus influenzae, Helicobacter pylori J99, Helicobacter pylori, 
Mycobacterium . tuberculosis, Porphyromonas • gingivalis, Rickettsia 
prowazekii,Streptomyces coelicolor, Streptomyces collinus. Streptococcus pneumoniae, 
Synechocystis sp. (strain PCC6803), Thermotoga maritime, and Treponema pallidum, a 

' mutant of any of the amino acid sequences, and a variants of any of the amino acid 
sequences. 

205 

SUBSTITUTE SHEET (RULE 26) 



WO 01/90301 




PCT/USO 1/1 1500 



60. The model of claim 52, wherein the MurG protein comprises an amino acid 
sequence selected from the group consisting of the amino acid sequences of MurG 
proteins as deposited in the NCBI database and identified with Accession Nos. 
CAB51993, A71316, E70579, C71699, F70195, A43727, JC1275, BVECMG, 
CEECAM, 083535, Q9ZK59, CAB85280, AAF39O20, BAA 18775, AAD26629, 
CAB73295, P37585, Q9ZHA9, Q9ZHDC0, Q9ZBA5, Q9X4H4, Q9WY74, P74657, 
006224, Q9Z702, 084766, 069552, )67238, 051708, 025770, 007670, 007109, 
P45065, CAB66324, AAC68356, AAF06830, P18579, P17443, P17952, P16457, 
P07862, 'AAE23178, AAD53936, CAA18668, CAA38869, CAA38868, CAA38867, 
CAA38866, AAD08196, BAA01453, BAA01455, BAA01454, AAD19042, CAA45558, 
CAA74235, AAD10537, AAD06652, AAC95450, CAA14869, AAC73201, AAC65509, 
AAC67113, AAC45636, CAB08640, AAC22793, AAC07193, BAA24357, CAB13395, 
BAA01355, AAB35538, 1904153C, 1808265B, 1808265A, CAA36866, CAA36869, 
CAA36868, CAA36867, CAA36776, and AAA99436. 

61. The model of claim 52, wherein the MurG protein comprises an amino acid 
sequence obtained from an organism selected from the group consisting of bacteria, 
small pathogenic organisms, cyano bacteria, higher-order bacteria, spirochetes and 
thermal stable bacteria. 

62. The model of claim 52, wherein the MurG protein comprises an amino acid 
sequence obtained from an organism selected from the group consisting of Escherichia 
coli. Bacillus subtilis, Aquefex aeolicus, Borrelia burgdorferi. Chlamydia pneumoniae, 
Chlamydia trachomatis, Enterococcus faecais, Enterococcus hirae, Haemophilus 
influenzae, Helicobacter pylori J99, Helicobacter pylori, Mycobacterium tuberculosis, 
Porphyromonas gingivalis, Rickettsia pro\vazekii,Streptomyces coelicolor, Streptomyces 
collinus, Streptococcus pneumoniae, Synechocystis sp. (strain PCC6803), Thermotoga 
maritime, and Treponema pallidum. 

63. The model of claim 52, wherein the MurG protein is a structural homologue of 
the E. coli MurG protein. 

64. The model of claim 52, wherein the structure comprises an n-terminal and C- 
terminal domain connected by a covalent peptide linker, and wherein each domain has an 
alpha/beta fold. 

65. The model of claim 52, wherein the RMSD is less than 2.5 A over at least 80 
aligned C-alpha atoms in each domain. 
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66. The model of claim 52, wherein the N-terminai domain comprises two glycine 
rich loops. 

67. The model of claim 66, wherein the amino acid sequence of the two glycine rich 
loops comprises GGTGGH and G-GGYVSG. 

68. The model of claim 52, wherein the C-terminal. domain comprises one glycine 
rich loop. 

69. The model of claim 68, wherein the glycine rich loop comprises the amino acid 
sequence GGSQGAR or GGS-GAR. 

70. The model of claim 52, wherein the atomic coordinates are generated by the 
method comprising the steps of: 

(a) providing a MurG protein in crystalline form; 

(b) generating an electron-density map of the crystalline MurG protein; and 

(c) analyzing the electron-density map to produce the atomic coordinates. 

71. The model of claim 70, wherein the crystalline MurG protein is produced by a 
method comprising the steps of: 

(a) combining MurG protein with UDP-GlcNAc ,and 

(b) inducing crystal formation to produce said crystalline MurG protein. 

72. The model of claim 70, wherein the crystalline MurG protein is produced by- the 
hanging drop method in which MurG in buffer is at a concentration of at least 5 ug/ml 
and is combined with a reservoir solution and crystallizes. 

73. The model of claim 72, wherein the buffer has a pH range from about 6:5 to 
about 9.0, and a buffer concentration range from about 10 mM to about 200 mM. 

74. The model of 73, wherein the buffer is a Tris or a Hepes buffer, having a pH from 
about 7.0 ig abouf'8.5. 

75. The model of 74, wherein the buffer has a pH of about 7:9. - • 

76. The model of claim 73, wherein the buffer further comprises at least one salt, 
chelating agent, or reducing agent. ... 

77. The model of claim 72, wherein the reservoir solution has a pH range from about 
5.0 to about 9.0 and the buffer concentration ranges from about 10 mM to about 1M. 

78. The model of claim 77, wherein the reservoir solution further comprises at least 
one suitable precipitant, a detergent, and a reducing agent. 

79. The model of claim 78, wherein the reservoir solution comprises a NaMES or 
sodium citrate buffer having a pH from about 6.0 to about 7.0. 
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80. The model of claim 79, wherein the buffer has a pH of about 6.5. 

81. The model of claim 78, wherein the precipitant is selected from the group 
consisting of ammonium sulfate and sodium potassium tartrate. 

82. The model of 78, wherein the detergent is TritonX-100. 

83. The model of 78, wherein the reducing agent is DTT, DTE or beta- 
mercaptoethanol. 

84. The model of claim 71, wherein the MurG protein and the UDP-GlcNAc are in a 
1:3 molar ratio. 

85. The model of claim 71, wherein the buffer comprises 0.1 M NaMES, pH6.5, 
0.9M (NH4)2S0 4 , 0.4% TRITON X-100®, and 10 mM dithiothreitol (DTT). 

86. The model of claim 71, wherein the step of generating an electron-density map 
comprises analyzing the crystalline MurG protein by X-ray diffraction. 

87. The model of claim 70, wherein the model is a computer image generated by a 
computer-readable medium encoded with a set of three-dimensional coordinates of the 
three-dimensional structure, wherein, using a graphical display software program, the 
three-dimensional coordinates create an electronic file that can be visualized on a 
computer capable of representing the electronic file as a three-dimensional image. 

88. A model of a donor nucleotide binding site of a UDP-glycosyltransferase (MurG) 
protein, wherein the model represents a three-dimensional structure that substantially 
conforms to the atomic coordinates of Table 4. 

89. The model of claim 88, wherein the donor nucleotide binding site is located 
within the MurG C-terminal domain. 

90. The model of claim 88, wherein the structure substantially conforms to the 
atomic coordinates* and B-values of Table 4. 

91. The model of claim 88, wherein at least about 50% of the structure has an 
average root-mean-square (RMSD) of less than about 2.5 A for the conserved amino acid 
residues for the donor nucleotide binding site of the E. coli MurG. 

92. The model of claim 88, wherein the donor nucleotide binding site comprises an 
amino acid sequence that is at least about 70% identical to the conserved amino acid 
residues of the donor nucleotide binding site of E. coli MurG. 

93. The model of claim 88, wherein the donor nucleotide binding site comprises an 
amino acid sequence that is at least about 80% identical to the conserved amino acid 
residues of the donor nucleotide binding site of the E. coli MurG. 
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94. The model of claim 88, wherein the donor nucleotide binding site comprises an 
amino acid sequence that is at least about 90% identical to the conserved amino acid 
residues of the donor nucleotide binding site of the E. coli MurG. 

95. The model of claim 88, wherein the donor nucleotide binding site comprises an 
amino acid sequence that is at least about 95% identical to the conserved amino acid 
residues of the donor nucleotide binding site of the E. coli MurG. 

96. The model of claim 88, wherein the atomic coordinates are generated by a 
method comprising the steps of: 

a) providing a Murg protein in a crystalline form: 

b) generating an electron-density map of said crystalline MurG protein; and 

c) analyzing the electron-density map to produce the atomic coordinates. 

97. The model of claim 88, wherein the model is a computer image generated by a 
computer-readable medium encoded with a set of three-dimensional coordinates of the 
three-dimensional structure, wherein, using a graphical display software program, the 
three-dimensional coordinates create an electronic file that can be visualized on a 
computer capable of representing the electronic file as a three-dimensional image. 

98. A model of an acceptor binding site of a UDP-glycosyltransferase (MurG) * 
protein, wherein the model represents a three-dimensional structure that substantially- 
conforms to the atomic coordinates of Table 5 . 

99.. The model of claim 98, wherein the structure substantially conforms to the- 
atomic coordinates and B-values of Table 5. 

100. The model of claim 98, wherein at least about 50%' of the structure has an 
average root-mean-square (RMSD) of less than about 1.5A for the conserved amino acid 
residues in the acceptor binding site. 

101. The model of claim 98, wherein the acceptor binding site comprises an amino 
acid sequence that is at least about 70% identical to the conserved amino a:cid residues of 
the acceptor binding site of E. coli MurG. . , 

102. the model of claim 9'8, wherein the acceptor binding site comprises an amino 
acid sequence that is at least about 80% identical to the conserved amino acid residues of 
the acceptor binding site of E. coli MurG. 

103. The model of claim 98, wherein the acceptor binding site comprises an amino 
acid sequence that is at least about 90% identical to the conserved amino acid residues of 
the E. coli MurG. 
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104. The model of claim 98, wherein the acceptor binding site comprises an amino 
acid sequence that is at least about 95% identical to the conserved amino acid residues of 
the acceptor binding site of the E. coli MurG. 

105. The model of claim 98, wherein the acceptor binding site comprises an amino 
acid sequence that is at least about 70% identical to the amino acid sequence selected 
from the group consisting of Escherichia coli, Bacillus subtilis, Aquefex aeolicus, 
Borrelia burgdorferi, Chlamydia pneumoniae, Chlamydia trachomatis, Enter ococcus 
faecais, Enterococcus hirae, Haemophilus influenzae, Helicobacter pylori J99, 
Helicobacter pylori, Mycobacterium tuberculosis, Porphyromonas gingivalis, Rickettsia 
prowazekii,Streptomyces coelicolor, Streptomyces collinus, Streptococcus pneumoniae, 
Synechocystis sp. (strain PCC6803), Thermotoga maritime, and Treponema pallidum. 

106. The model of claim 98, wherein the atomic coordinates are generated by the 
method comprising the steps of: 

a) providing a MurG protein in a crystalline form: 

b) generating an electron-density map of said crystalline MurG protein; and 

c) analyzing the electron-density map to produce the atomic coordinates. 

107. The model of claim 98, wherein the model is a computer image generated by a 
computer-readable medium encoded with a set of three-dimensional coordinates of the 
three-dimensional structure, wherein, using a graphical display software program, the 
three-dimensional coordinates create an electronic file that can be visualized on a 
computer capable of representing the electronic file as a three-dimensional image. 

108. A model of a membrane association site of a UDP-glycosyltransferase (MurG) 
protein, wherein the model represents a three-dimensional structure that substantially 
conforms to the atdrnic coordinates of Table 6. 

109. The model of claim 108, wherein the structure substantially conforms to the 
atomic coordinates and B-values of Table 4. 

110. The model of claim 108, wherein at least about 50% of the structure has an 
average root-mean-square (RMSD) of less than about 1.5 A for conserved amino acid 
residues in the E. coli membrane association site. 

111. The model of claim 108, wherein the membrane association site comprises an 
amino acid sequence that is at least about 70% identical to the conserved amino acid 
residues of the membrane association site of E. coli MurG. 
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112. The model of claim 108, wherein the membrane association site comprises an 
amino acid sequence that is at least about 80% identical to the conserved amino acid 
residues of the membrane association site of the E. coli MurG. 

113. The model of claim 108, wherein the membrane association site comprises an 
amino acid sequence that is at least about 90% identical to the conserved amino acid 
residues of the membrane association site of the E. coli MurG. 

114. The model of claim 108, wherein the membrane association site comprises an 
amino acid sequence that is at least about 95% identical to the conserved amino acid 
residues of a membrane association site of the E. coli MurG. 

115. The model of claim 108, wherein the membrane association site comprises an 
amino acid sequence that is at least about 70% identical to the amino acid sequence from 
organisms selected from the group consisting of Escherichia coli, Bacillus subtilis, 
Aquefex aeolicus; Borrelia burgdorferi, Chlamydia pneumoniae. Chlamydia 
trachomatis, Enterococcus faecais, Enterococcus hirae. Haemophilus influenzae, 
Helicobacter pylori J99, Helicobacter pylori, Mycobacterium tuberculosis, 
Porphyromonas gingivalis, Rickettsia prowazekii,Streptomyces coelicolor, Streptomyces 
collinus, Streptococcus pneumoniae, Synechocystis sp. (strain PCC6803), Thermotoga 
maritime, and Treponema pallidum. 

' 116. The model of claim 108, wherein the model is a computer image generated by a 
computer-readable medium encoded with a set of three-dimensional coordinates of the 
three-dimensional structure, wherein, using a graphical display software program, the 
three-dimensional coordinates create an electronic file that can be visualized on a 
computer capable of representing the electronic file as a three-dimensional jmage. 
117: A computer-assisted method of structure based drug design, of bioactive 
compounds, comprising the steps of: 

(a) providing a model of a UDP-glycosyltransferase (MurG)protein or a 
donor nucleotide binding site, acceptor binding site or membrane association site; 
and * • : 

(b) designing a chemical compound using said model. ..... 

118. The method of claim 117, further comprising the step of synthesizing the 
chemical compound. •-*•♦.-.•.« 
.119.' The method of claim 1 18; further comprising - the step : . of. evaluating the 
bioactivity of the synthesized chemical compound. • <■ 
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120. The method of claim 118, wherein the model of the UDP-glycosy (transferase 
(MurG) protein represents a three-dimensional structure comprising the atomic 
coordinates listed in Table I . 

121. The method of claim 1 18, wherein the model of the donor nucleotide binding site 
represents a three-dimensional structure comprising the atomic coordinates Table 4. 

122. The method of claim 118, wherein the model of the acceptor binding site 
represents a three-dimensional structure comprising the atomic coordinates in Table 5. 

123. The method of claim 118, wherein the model of the membrane association site 
represent a three-dimensional structure comprising the atomic coordinates in Table 6. 

124. The method of claim 118, wherein the model comprises a computer image 
generated when the atomic coordinates listed in Table 1 are analyzed on a computer 
using a graphical display software program to create an electronic file of the image and 
visualizing the electronic file on a computer capable of representing the electronic file as 
a three-dimensional image. 

125. The method of claim 1 18, wherein the step of designing comprises computational 
screening of one or more databases of chemical compounds in which the three 
dimensional structure of said compounds are known. 

126. The method of claim 125, further comprising interacting a compound identified 
by the screening step with the model by computer. 

127. The method of claim 1 18, wherein the step of designing comprises directed drug 
design. 

128. The method of claim 1 18, wherein the step of designing comprises random drug 
design. 

129. The method of claim 1 18, wherein the step of designing comprises grid-based 
drug design. 

130. The method of claim 118, wherein the step of designing comprises selecting 
compounds which are predicted to mimic the three-dimensional structure of the three- 
dimensional structure of the MurG protein. 

131. The method of claim 118, wherein the step of designing comprises selecting 
compounds which are predicted to bind to the three-dimensional structure of the MurG 
protein. 

132. The method of claim 118, wherein the bioactivity is selected from the group 
consisting of inhibiting binding of a nucleotide donor compound to the MurG protein, 
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inhibiting binding of an acceptor compound to the MurG protein, or inhibiting 
association of the MurG Protein to a membrane. 

133. A model of the three dimensional structure of a MurG protein , wherein the 
model is produced by the following method comprising the steps of: 

(a) providing an amino acid sequence of a MurG protein and the amino acid 
sequence of the Escherichia coli MurG protein; 

(b) identifying structurally conserved regions shared between the the MurG : 
protein and the E. coli MurG protein; and 

(c) determining atomic coordinates for the MurG protein by assigning the 
structurally conserved regions of the MurG protein to a three dimensional structure using 
a three dimensional structure of the MurG protein which substantially conforms to the 
atomic coordinates represented, in Table 1, to derive a model of the three dimensional 
structure of the MurG protein amino acid sequence. 

134. The model of claim 133, wherein the MurG protein amino acid sequence 
comprises the sequence of an amino acid sequence selected from the group consisting of 
the aminoVacid sequences of MurG proteins as deposited in the NCBI database and 
identified with Accession Nos. CABS 1993, A71316, E70579, C71699, F70195, A43727, 
JC1275, BVECMG, CEECAM, 083535, Q9ZK59, CAB85280, AAF3902O, BAA18775, 
AAD26629, CAB73295, P37585, Q9ZHA9, Q9ZHDC0, Q9ZBA5, Q9X4H4, Q9WYJ4, 
P74657, 006224, Q9Z702, 084766, 069552, )67238, 051708, O25770, 007670, 
007109, P45065, CAB66324, AAC68356, AAF06830, P18579, P17443, P17952, 
P16457, P07862, AAE23178, AAD53936, CAA18668, CAA38869, CAA38868, 
CAA38867, CAA38866, AAD08196, BAA01453, BAA01455, BAA01454, AAD19042, 
CAA45558, CAA74235, AAD10537, AAD06652, AAC95450, CAA14869, AAC73201, 
AAC65509, AAC671 13, AAC45636 , CAB08640, AAC22793, AAC07193, BAA24357, 
CAB13395, BAA01355, AAB35538, 1904153C, 1808265B, 1808265A, CAA36866, 
CAA36869, CAA36868, CAA36867, CAA36776, and AAA99436. • 

135. A composition for inhibiting the activity of a glycosyltransferase comprising a 
compound that inhibits the activity of a glycosyltransferase, wherein the compound is 
identified by the method comprising the steps of: . 
• (a) providing a three-dimensional structure of a MurG protein; . 
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(b) using the three-dimensional structure of the MurG protein to design a 
chemical compound that inhibits activity of a glycosyltransferase; 

(c) synthesizing the chemical compound; and 

(d) evaluating the ability of the chemical compound to inhibit the activity of a 
glycosyltransferase. 

136. e composition of claim 135, wherein the glycosyltransferase is a MurG protein. 

137. The composition of claim 135, wherein the three-dimensional structure of the 
MurG protein substantially conforms to atomic coordinates represented by Table 1. 

138. The composition of claim 135, wherein the compound is selected from the group 
consisting of an inorganic and an organic compound. 

139. The composition of claim 135, wherein the compound is a substituted pyrimidine 
analogs 

140. The composition* of claim 135, wherein the compound is selected from the group 
consisting of an analog of a MurG protein, a substrate analog of a MurG protein, a donor 
molecule analog of a MurG protein, and a membrane analog of a MurG protein. 

141. The composition of claim 135, further comprising a component selected from the 
group consisting of an excipient an adjuvant, and a carrier. 

142. A composition for stimulating the activity of a glycosyltransferase comprising a 
compound that stimulates the activity of a glycosyltransferase, wherein the compound is 
identified by the method comprising the steps of: 

(a) providing a three-dimensional structure of a MurG protein; 

(b) using the three-dimensional structure of the MurG protein to design a 
chemical compound that inhibits activity of a glycosyltransferase; 

(c) synthesizing the chemical compound; and 

(d) evaluating the ability of the chemical compound to stimulate the activity 
of a glycosyltransferase. 

143. A method to determine a three-dimensional structure of a MurG protein 
comprising the steps of: 

(a) providing an amino acid sequence of a MurG protein, wherein the three- 
dimensional structure of the MurG protein is not known; 

(b) analyzing the pattern of folding of the amino acid sequence in a three- 
dimensional conformation by fold recognition; and 
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(c) comparing the pattern of folding of the MurG protein amino acid 
sequence with the three dimensional structure of the E. coli MurG protein, wherein the 
three-dimensional structure of the E. coli ..MurG protein substantially conforms to the 
atomic coordinates represented. in Table L 

144. A method to derive a model of the three-dimensional structure of a MurG protein 
comprising the steps of: 

(a) providing an amino acid sequence of a MurG protein; 

(b) identifying structurally conserved regions shared between the MurG 
protein and the E. coli MurG protein; 

(c) determining atomic coordinates for the MurG protein structure by 
assigning the structurally conserved regions of the MurG protein to a three-dimensional 
structure using a three dimensional structure of the E. coli MurG protein based on atomic 
coordinates represented in Table 1 to derive a model of the three dimensional structure of 
the MurG protein amino acid sequence. 

145. The method of claim 144, further comprising assigning atomic coordinates for 
side chains of said MurG protein by determining sterically allowable positions using a 
library of rotamers. 

146.. A method to derive a three dimensional structure of a crystallized MurG protein 
comprising the steps of: 

(a) comparing the Patterson function of a crystallized MurG protein with the 
Patterson function of crystalline E. coli MurG protein to produce an electron-density 
map of the crystallized MurG protein; and 

.(b) analyzing the electron-density map to produce the three dimensional 
structure of the crystallized MurG protein. 

147. The method of claim 146, further comprising the step of rotating the Pattersor 
function of the crystallized MurG protein on the Patterson function of the crystalline E 

. coli MurG protein to determine the correct orientation of the crystallized MurG proteir 
in a crystal of said crystallized MurG protein to identify the initial phases of tht 
crystallized MurG protein. 

148. The method of claim 146, further comprising the step of electronically 
stimulating the three dimensional structure of the crystallized MurG protein to derive \ 
computer image of the three dimensional structure of the crystallized MurG protein. 
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Group V, Claim 10, drawn to the 3-D structure of the membrane association site of the MurG protein. 

Group VI, Claims 11-19, drawn to the computer image of the 3-D MurG protein. 

Group VII, Claims 20-23, drawn to the 3-D image of the 3-D image of an aipha-carbon backbone. 

Group VIII, Claims 2+-2S, drawn to the 3-D computer image of the 3-D structure of a donor nucleotide of a donor 

nucleotide binding site of the MurG protein. 

Group IX, Claims 29-33, drawn to the 3-D computer image of a 3-D structure of an acceptor binding site of the MurG 
protein. 

Group X. Claims 3+-3S. drawn to the 3-D computer image of the 3-D structure of a membrane association site of the 
MurG protein. 

Group XI. Claims, 39-+ 1. drawn to the computer readable medium encoded with a set of 3-D coordinates of a MurG 
protein. 

Group XII, Claim +2, drawn to the computer readable medium encoded with a set of 3-D coordinates of a donor 
nucleotide binding site of a MurG protein. 

Group XIII. Claim 43, drawn to the computer readable medium with a set of 3-D coordinates of an acceptor binding 
site of the MurG protein. 

Group XIV, Claim +4, drawn to a computer readable medium encoded with 3-D coordinates of a membrane association 
site of the MurG protein. 

Group XV. Claims +5-51. drawn to a method for identifying a potential inhibitor of a UDP-glycotransferase enzvme. 
Group XVI, Claims 52-S7, drawn to a model of UDP-glycotransferase where the model represents a 3-D structure. 
Group XVII, Claims 88-97, drawn to a model a donor nucleotide binding site of a UDP-glycosyltransferase protein. 
Group XVIII, Claims 98-107, drawn to a model of an acceptor binding site of a UDP-glycotransferase protein. 
Group XIX, Claims 108-116, drawn to a model of a membrane association site of a UDP-glycotransferase protein. 
Group XX, Claims 117-132, drawn to a computer-assisted method of structure based drug design of bioreative 
compounds. 

Group XXI, Claims 133-13+, drawn to a model of a 3-D structure of a MurG protein. 
Group XXII, Claims 135-1 + 1, drawn to a composition for inhibiting the activity of a glycotransferase. 
Group XXIII, Claim 1+2. drawn to a composition for stimulating the activity of glycotransferase. 
Group XXIV, Claims 1+3-1+S, drawn to a method to determine the 3-D structure of a MurG protein. 

This International Searching Authority considers the international application does not comply with tlie 
requirements of unity of invention (Rules 13.1, 13.2 and 13.3) for the reasons indicated below: Tlie inventions lifted in 
Groups 1-2+ do not relate a single inventive concept under PCT Rule 13.1 because, under PCT Rule 13.2, they lack 
the same or corresponding special technical features for the following reasons: the special technical feature for Group 1 
is that the MurG is being claimed in its crystalline form. In Group II flu- special technical feature is that tin- 
structure of the crystalline form of the MurG protein is being claimed. Group Ill's special technical feature is that the 
Group is claiming the 3-D structure of the binding site of MurG. In Group IV the special technical feature emerges as 
the 3-D structure of the acceptor binding site of tlie MurG protein. Group V embraces the special technical feature of 
the 3-D structure of a membrane association site In Group VI the special technical feature is that of the 3-D 
computer intake of the 3-D structure of the MurG protein. Group VII has the special technical feature of a J-D 
structure of a 3-D structure of an alpha-carbon backbone of the MurG protein. Group VIII deals with the special 
technical feature of the 3-D computer image of the 3-D structure of a donor nucleotide binding site of the instant 
protein. Group IX involves the special technical feature of the 3-D computer image* of the 3-D structure of an acceptor 
binding site. Group X develops tlie special technical feature of the 3-D computer image of the ^-D structure of a 
membrane association site of the MurG protein. In Group XI the subject of the special technical feature of a computer 
readable medium encoded within a set of 3-D coordinates of the MurG protein are described. Group XII addresses the 
technical feature of a computer readable medium that is encoded with a set of 3-D coordinates of a donor nucleotide 
binding site of the MurG protein. In Group XIII is drawn to the special technical feature of a computer readable 
medium with a 3-D set of coordiantes to accommodate the acceptor binding site of a MurG protein. Group IV 
involves the special technical feature of a computer readable medium that is encoded with a set of 3-D coordinates that 
is used for the membrane association site of the MurG protein. Group V is involved with the special technical feature 
of a method for identifying a potential inhibitor a UDP-glycotransferase enzyme. The special technical feature of 
Group XVI is that of a 3-D model of UDP-glycotransferase. Group XVII addresses the special technical feature of a 
model of a donor nucleotide binding site of UDP-glycotransferase. Group XVIII addresses the special technical feature 
of the model of an acceptor binding site of a UDP glycotransferase. Group XIX consists of the special technical 
feature of a model of tlie membrane association site of a UDP-glycotransferase. In Group XX we see that the special 
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technical feature is that of a computer-assisted method of structure based drug design. In Group XXI the special 
technical feature that emerges is that of a model of a 3-D structure of a MurG protein. Group XXII deals with the 
special technical feature of a composition for inhibiting the activity of a gl ycotransferase. Group XXIII addresses the 
special technical feature of a composition for the stimulation of activity of a glycotransferase. Initially Group XXIV 
speaks to the special technical feature of a method to determine the 3-D structure of the MurG protein. 
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